• zbyte64@awful.systems
    link
    fedilink
    English
    arrow-up
    1
    ·
    18 hours ago

    How are you able to understand it’s capability without understanding what tools it is capable of manipulating to effect?

    • Communist@lemmy.frozeninferno.xyz
      link
      fedilink
      English
      arrow-up
      1
      ·
      17 hours ago

      You aren’t, and that’s exactly what I’m saying, it’s capable of doing these things with tools, therefore it’s capable of doing these things.

      • zbyte64@awful.systems
        link
        fedilink
        English
        arrow-up
        1
        ·
        13 hours ago

        So why are you allergic to people talking about the quality of the tools in regards to capability?

          • zbyte64@awful.systems
            link
            fedilink
            English
            arrow-up
            1
            ·
            13 hours ago

            You are the one collapsing tool use into a binary when there are varying degrees of competency and hand holding.

            • Communist@lemmy.frozeninferno.xyz
              link
              fedilink
              English
              arrow-up
              1
              ·
              12 hours ago

              I am not, you inaccurately said that the math olympiad was not bested by llm’s because they had a tool that told them if they were close but incorrect and can just try an infinite number of times. This is incorrect, they had a number of tries with python. This just isn’t a true statement. I think them besting it with use of python is equally significant and still counts as them besting it, and saying they can’t do math work is absurd.

              • zbyte64@awful.systems
                link
                fedilink
                English
                arrow-up
                1
                ·
                edit-2
                2 hours ago

                It’s not “bested” by the LLM though, a mathematician used the LLM as a tool to disprove a conjecture. Subtract the mathematicians from the process and the LLM would not have successfully completed the task. It would be more accurate to say a mathematician with an LLM was able to best a mathematician who did not have an LLM. Which is cool, but we don’t need to pretend the LLM is not a tool but something that “understands” math like a mathematician