• Kay Ohtie@pawb.social
        link
        fedilink
        English
        arrow-up
        5
        ·
        23 hours ago

        All of these features are not something the models themselves can do, but are grafted on.

        I could easily write a Home Assistant automation pattern matching for nearly every way someone could say “how many Rs are in strawberry”, depluralize a plural letter, and run it against “wc” in a bash terminal.

        That doesn’t mean it’s smarter. It’s that I’ve added something specific to it.

        MCP and the like is just that too, gluing on functions or the ability to hopefully invoke a function. That’s why so many hilariously mundane ones exist.

        At the core, it’s still a large language model: a statistical model of frequency of word and word chunk (token) patterns.

        Sometimes one model can invoke another via that tooling but it’s still a grafting on. It isn’t a singular thing or system, but disjointed pieces so completely detached from how brains work.

        This isn’t AI hate, it’s reality. I love the field of artificial intelligence and machine learning. It’s cool as hell. But an LLM is fundamentally incapable of being anything more than an LLM with glued on pieces that invoke functionality.

        OpenAI saw people mock the inability to count so they wrote a specialized tool to count letters and glued it on.

        The world is full of endless edge cases. The inability to simply resolve them without gluing on every single one means it just isn’t doing anything new.

        • MangoCats@feddit.it
          link
          fedilink
          English
          arrow-up
          3
          ·
          18 hours ago

          I believe the progress of the last year is largely attributable to the appropriate “grafting on” of these wrappers around the LLM cores.

        • Communist@lemmy.frozeninferno.xyz
          link
          fedilink
          English
          arrow-up
          2
          ·
          22 hours ago

          They regularly win olympiad mathematics up from not standing a chance and just created a novel solution to the erdos conjecture, them counting the r’s in strawberry is inconsequential but also something they can do even if you just use the raw api or a local model.

          • zbyte64@awful.systems
            link
            fedilink
            English
            arrow-up
            3
            ·
            19 hours ago

            Using computers to search for a counter example to a conjecture isn’t exactly new ground and I suspect they did so with the aide of some harness tweaks like some numerical LSP. Like cool, it pushed the envelope but like what the parent said, they grafted on the ability to do a specific task.

              • zbyte64@awful.systems
                link
                fedilink
                English
                arrow-up
                0
                ·
                edit-2
                4 hours ago

                Aren’t you the least bit curious what tools they gave the LLM and how the LLM used those tools? It’s like back in math class you are asked to solve a quadratic formula but you forgot how. So you use the calculator to try different numbers and the calculator is telling you if you are getting closer. Sure I got the right answer, but it’s hardly a testament to my math skills.

      • criss_cross@lemmy.world
        link
        fedilink
        English
        arrow-up
        9
        ·
        1 day ago

        A lot of tools like Claude or ChatGPT have internal tools they call when they do math (or use a python script) rather than have the model actually compute anything.

        The underlying tech itself can’t do it because you can’t do math by token probability.

        • SpaceDuck@feddit.org
          link
          fedilink
          English
          arrow-up
          1
          ·
          9 hours ago

          Is that relevant? Mathematicians will use tools and computers that calculate for them too. Are we saying they should all do it in their heads?