• MinnesotaGoddam@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    ·
    edit-2
    1 hour ago

    okay how many of these “delusional” people in the study are making fun of the LLM tho

    i don’t know because I don’t use the LLM i only see the screenshots. I am the control group. kinda. my nut is already off.

  • ExLisper@lemmy.curiana.net
    link
    fedilink
    English
    arrow-up
    27
    ·
    13 hours ago

    I think what we’re seeing is similar to lactose intolerance. Most people can handle it just fine but some people simply can’t digest it and get sick. The problem is there’s no way to determine who can handle AI and who can’t.

    When I’m reading about people developing AI delusions their experiences sound completely alien to me. I played with LLMs same as anyone and I never treated it as anything other than a tool that generates responses to my prompts. I never thought “wow, this thing feels so real”. Some people clearly have predisposition to jumping over the “it’s a tool” reaction straight to “it’s a conscious thing I can connect with”. I think next step should be developing a test that can predict how someone will react to it.

    • baaaaaah@hilariouschaos.com
      link
      fedilink
      English
      arrow-up
      2
      ·
      3 hours ago

      Surprisingly, the people who have that issues with it aren’t the ones who contact to it emotionally, it’s the people who offload their decision making to AI

      It’s more like a codependence spiral than anything else

    • Tiresia@slrpnk.net
      link
      fedilink
      English
      arrow-up
      9
      ·
      11 hours ago

      Cults and toxic self-help literature have existed before LLMs copied them. I don’t know if LLMs are getting people who couldn’t have been gotten by human scammers.

      Scams have many different vectors and people can be vulnerable to them depending on their mood or position in life. Testing people on LLM intolerance would be more like testing them on their susceptibility to viruses.

      People can be immunocompromised for various reasons, temporarily or permanently, so as a society public hygiene standards (and the material conditions to produce them) are a lot more valuable. Wash your hands after interacting, keep public spaces clean, that sort of stuff.

      • ExLisper@lemmy.curiana.net
        link
        fedilink
        English
        arrow-up
        2
        ·
        8 hours ago

        Yes, definitely can be a temporary thing which would make it even harder to protect people from. It’s also most likely some spectrum. If you’re “resistance” is at 10 you may not be at risk even at your lowest point. Other people can be at 5 when they are doing great but risk psychosis when they are down for some reason. I just think it’s kind of scary that people interact with it voluntarily (unlike with scammers or cults) without knowing how it will affect them. We all tried LLMs but most of us was lucky so far.

    • chunes@lemmy.world
      link
      fedilink
      English
      arrow-up
      7
      ·
      13 hours ago

      I have yet to see any evidence that AI is inducing problems. People with problems use it just like anyone else and others consider that use problematic.

  • Snot Flickerman@lemmy.blahaj.zone
    link
    fedilink
    English
    arrow-up
    185
    ·
    21 hours ago

    Huge Study

    *Looks inside

    this latest study examined the chat logs of 19 real users of chatbots — primarily OpenAI’s ChatGPT — who reported experiencing psychological harm as a result of their chatbot use.

    Pretty small sample size despite being a large dataset that they pulled from, its still the dataset of just 19 people.

    AI sucks in a lot of ways sure, but this feels like fud.

    • XLE@piefed.social
      link
      fedilink
      English
      arrow-up
      47
      ·
      19 hours ago

      The hugeness is probably

      391, 562 messages across 4,761 different conversations

      That’s a lot of messages

        • Snot Flickerman@lemmy.blahaj.zone
          link
          fedilink
          English
          arrow-up
          3
          ·
          2 hours ago

          …and about 82 messages per conversation. Also, at least half of all the messages are from the user to the AI, and the other half are from the AI to the user, meaning around 41 messages from the user per conversation.

          • lad@programming.dev
            link
            fedilink
            English
            arrow-up
            2
            ·
            47 minutes ago

            Yeah, I also thought about that, looks like a lot, but I guess users in this case differ from ordinary usage

      • braxy29@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        6 hours ago

        "We received chat logs directly from people who self-identified as having some psychological harm related to chatbot usage (e.g. they felt deluded) via an IRB-approved Qualtrics survey "

    • InternetCitizen2@lemmy.world
      link
      fedilink
      English
      arrow-up
      25
      ·
      21 hours ago

      I remember reading my old states book that said a minimum of 30 points needed for normal distribution. Also typically these small sets about proof of concept, so yeah you still got a point.

      • tburkhol@lemmy.world
        link
        fedilink
        English
        arrow-up
        33
        ·
        20 hours ago

        fud: Fear, Uncertainty and Doubt. A tactic for denigrating a thing, usually by implication of hypothetical or exaggerated harms, often in vague language that is either tautological or not falsifiable.

            • wonderingwanderer@sopuli.xyz
              link
              fedilink
              English
              arrow-up
              1
              ·
              edit-2
              3 minutes ago

              They also use the words “the,” “at,” “is,” and “it,” but that doesn’t make it their jargon.

              We really need to stop condemning entire words just because some people we don’t like used them…

              I can’t tell you how many times I’ve been accused of using a “dogwhistle” because I used a totally innocuous word in accordance with its literal meaning without having any idea that it’s apparently been co-opted by some group of hate-filled extremists because I don’t follow those groups and I don’t know their lingo

              Like, soon we won’t have any words left that we’re still allowed to use. Language is already getting dumbed down, and I’m tired of walking on eggshells lest I say a word that could potentially be misinterpreted in light of a vague association to a different term that has a double-entendre that some niche circles use in some reprehensible way in their ostensibly secret code, or that I didn’t know was a euphemism…

      • porcoesphino@mander.xyz
        link
        fedilink
        English
        arrow-up
        11
        ·
        edit-2
        19 hours ago

        Where are you hearing it so much? (And ideally can you describe it in a little more detail than saying it’s crypto bros again?)

        • XLE@piefed.social
          link
          fedilink
          English
          arrow-up
          11
          ·
          edit-2
          3 hours ago

          Crypto bros are infamous for describing any criticism as FUD, no matter the criticism. It’s like a verbal tic. Here are some examples from the past couple days on the premiere Bitcoin social network:

          When all this FUD ends and Bitcoin goes 🚀

          Quantum FUD is at ATH

          FUD Busters [NFT]

          Flokicoin is built to last… Don’t follow the FUD.

          • Rekall Incorporated@piefed.social
            link
            fedilink
            English
            arrow-up
            5
            ·
            8 hours ago

            While I am aware that it’s a common crypto shill term, I think by this point crypto has fallen out of the mainstream, so their usage of terms doesn’t really matter.

            And as others have pointed out, the term FUD has been used at least since the birth of WWW/modern internet.

            • XLE@piefed.social
              link
              fedilink
              English
              arrow-up
              3
              ·
              edit-2
              1 hour ago

              I have no argument there, the phrase was definitely not created by them, it’s just been beaten to death by them.

              They’ve also overused a bunch of ancient and unfunny memes well past their expiration dates, and universally adopted a collection of depressingly dull and incorrect slogans. “FUD” is just the one that has interesting meaning outside their sad sphere.

  • amgine@lemmy.world
    link
    fedilink
    English
    arrow-up
    37
    ·
    21 hours ago

    I have a friend that’s really taken to ChatGPT to the point where “the AI named itself so I call it by that name”. Our friend group has tried to discourage her from relying on it so much but I think that’s just caused her to hide it.

    • d00ery@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      edit-2
      10 hours ago

      I certainly enjoy talking to LLMs about work for example, asking things like “was my boss an arse to say x, y, z” as the LLM always seems to be on my side… Now it could be my boss is an arse, or it could be the LLM sucking up to me. Either way, because of the many examples I’ve read online, I take it with a pinch of salt.

      • frongt@lemmy.zip
        link
        fedilink
        English
        arrow-up
        9
        ·
        9 hours ago

        It’s definitely sucking up to you. It’s programmed to confirm what you say, because that means you keep using it.

        Consider how you phrase your questions. Try framing a scenario from the position of your boss, or ask “why was my boss right to say x, y, z”, and it’ll still agree with you despite the opposite position.

        If you’re just shooting the shit, consider doing it with a human being. Preferably in person, but there are plenty of random online chat groups too

      • Rekall Incorporated@piefed.social
        link
        fedilink
        English
        arrow-up
        1
        ·
        8 hours ago

        I use LLMs for work (low priority stuff to save time on search or things that I know I will be validate later in the process) and I can’t stand the writing style and the constant attempts to bring in adjacent unrelated topics (I’ve been able to tone down the cute language and bombastic delivery style in Gemini’s configuration).

        It’s like Excel trying chat with me when I am working with a pivot table or transforming data in PowerQuery.

  • givesomefucks@lemmy.world
    link
    fedilink
    English
    arrow-up
    40
    ·
    22 hours ago

    As the researchers wrote in a summary of their findings, the “most common sycophantic code” they identified was the propensity for chatbots to rephrase and extrapolate “something the user said to validate and affirm them, while telling them they are unique and that their thoughts or actions have grand implications.”

    There’s a certain irony in all the alright techbros really just wanting to be told they were “stunning and brave” this whole time.

    • Tiresia@slrpnk.net
      link
      fedilink
      English
      arrow-up
      1
      ·
      11 hours ago

      Are the users in this study techbros?

      Besides, tech bros didn’t program this in, this is just an LLM getting stuck in the data patterns stolen from toxic self-help literature.

      For decades there has been a large self-help subculture who consume massive amounts of vacuous positive affirmation produced by humans. Now those vacuous affirmations are copied by the text copying machine with the same result and it’s treated as shocking.

      • very_well_lost@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        37 minutes ago

        Besides, tech bros didn’t program this in, this is just an LLM getting stuck in the data patterns stolen from toxic self-help literature.

        That’s not necessarily true. The AI’s output is obviously shaped by the training data, but much of it is also shaped by the prompt (and I don’t just mean your prompt as a user).

        When you interact with (for example) ChatGPT, your prompt gets merged into a much larger meta-prompt that you don’t get to see. This meta-prompt includes things like what tone the AI should use, how the AI should identify itself, how the AI should steer the conversation, what topics the AI should avoid, etc. All of that is under the control of the people designing these systems, and it’s trivially easy for them to adjust the way the AI behaves in order to, for example, maximize your engagement as a user.

    • A_norny_mousse@piefed.zip
      link
      fedilink
      English
      arrow-up
      3
      ·
      16 hours ago

      Huh. I hate it when people do that. Fake/professional empathy/support. Yet others gobble it up when a machine does that.

  • Hackworth@piefed.ca
    link
    fedilink
    English
    arrow-up
    14
    ·
    edit-2
    21 hours ago

    Anthropic has some similar findings, and they propose an architectural change (activation capping) that apparently helps keep the Assistant character away from dark traits (sometimes). But it hasn’t been implemented in any models, I assume because of the cost of scaling it up.

    • porcoesphino@mander.xyz
      link
      fedilink
      English
      arrow-up
      13
      ·
      edit-2
      19 hours ago

      When you talk to a large language model, you can think of yourself as talking to a character

      But who exactly is this Assistant? Perhaps surprisingly, even those of us shaping it don’t fully know

      Fuck me that’s some terrifying anthropomorphising for a stochastic parrot

      The study could also be summarised as “we trained our LLMs on biased data, then honed them to be useful, then chose some human qualities to map models to, and would you believe they align along a spectrum being useful assistants!?”. They built the thing to be that way then are shocked? Who reads this and is impressed besides the people that want another exponential growth investment?

      To be fair, I’m only about 1/3rd of the way through and struggling to continue reading it so I haven’t got to the interesting research but the intro is, I think, terrible

      • nymnympseudonym@piefed.social
        link
        fedilink
        English
        arrow-up
        6
        ·
        19 hours ago

        stochastic parrot

        A phrase that throws more heat than light.

        What they are predicting is not the next word they are predicting the next idea

        • porcoesphino@mander.xyz
          link
          fedilink
          English
          arrow-up
          2
          ·
          edit-2
          6 hours ago

          How it functionally works, its the next word / token / chunk a lot more than its an “idea”. An idea is even rough to define

          The other relatively accurate analogy is a probabilistic database

          Neither work if you’ve fallen into anthropomorphising, but they’re relatively accurate to architecture and testing for people that aren’t too computer literate, far more than the anthropomorphising alternatives at least

        • kazerniel@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          ·
          7 hours ago

          throws more heat than light

          Thanks, I haven’t heard this phrase before, but it feels quite descriptive :)

        • ageedizzle@piefed.ca
          link
          fedilink
          English
          arrow-up
          10
          ·
          edit-2
          18 hours ago

          Technically, they are predicting the next token. To do that properly they may need to predict the next idea, but thats just a means to an end (the end being the next token).

          • affenlehrer@feddit.org
            link
            fedilink
            English
            arrow-up
            3
            ·
            15 hours ago

            Also, the LLM is just predicting it, it’s not selecting it. Additionally it’s not limited to the role of assistant, if you (mis) configure the inference engine accordingly it will happily predict user tokens or any other token (tool calls etc).