LibreTechni.ca
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
sanitation@lemmy.today to Technology@lemmy.worldEnglish · 2 days ago

Advanced AI models suffer a near-total collapse on classic psychology test as cognitive demands increase

www.psypost.org

external-link
message-square
82
fedilink
248
external-link

Advanced AI models suffer a near-total collapse on classic psychology test as cognitive demands increase

www.psypost.org

sanitation@lemmy.today to Technology@lemmy.worldEnglish · 2 days ago
message-square
82
fedilink
Just a moment...
www.psypost.org
external-link
  • Communist@lemmy.frozeninferno.xyz
    link
    fedilink
    English
    arrow-up
    1
    ·
    22 hours ago

    No.

    https://www.nature.com/articles/d41586-025-02343-x

    It’s lying

    • zbyte64@awful.systems
      link
      fedilink
      English
      arrow-up
      1
      ·
      19 hours ago

      You know the “DeepMind and OpenAi models” is the hint that the LLM model is not the one doing the math. The LLM provides a hypothesis and the DeepMind model provides grounding or feedback on whether the hypothesis even makes sense or works.

      • Communist@lemmy.frozeninferno.xyz
        link
        fedilink
        English
        arrow-up
        1
        ·
        11 hours ago

        It is totally irrelevant that the model calls tools to do the math. That is still a success.

        • zbyte64@awful.systems
          link
          fedilink
          English
          arrow-up
          1
          ·
          edit-2
          2 hours ago

          It’s relevant to what the parent was saying about LLMs. The success of the LLM in using mathematical tools does not contradict what they were saying. To then accuse them of lying because of a misunderstanding is… bad form.

Technology@lemmy.world

technology@lemmy.world

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@lemmy.world

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


  • @L4s@lemmy.world
  • @autotldr@lemmings.world
  • @PipedLinkBot@feddit.rocks
  • @wikibot@lemmy.world
Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 3.87K users / day
  • 8.88K users / week
  • 15.8K users / month
  • 30.5K users / 6 months
  • 1 local subscriber
  • 85.7K subscribers
  • 5.93K Posts
  • 185K Comments
  • Modlog
  • mods:
  • L3s@lemmy.world
  • enu@lemmy.world
  • L4sBot@lemmy.world
  • Technopagan@lemmy.world
  • L3s@hackingne.ws
  • BE: 0.19.5
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org