• kell_t@programming.dev
    link
    fedilink
    English
    arrow-up
    1
    ·
    3 hours ago

    Thinking/reasoning tokens kind of approximate that actually, which is what most flagships and even my own local LLM use.

    Thinking tokens are quite like normal generative tokens, except that the LLM is ‘talking’ to itself. You can see its thoughts (depending on what settings you’ve put/IDE you use), but they aren’t meant to be the actual response to your prompt. They are what the AI is designed to draft their answer before committing, to explore different options and to ‘reason’ itself into a more refined response.

    Reasoning tokens is how AI can actually do math now, rather than just guess a number and pray, by the way.