• SlimePirate@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    1
    ·
    3 hours ago

    The fact that it uses a non-trivial neural network. If it was simply a rate count of based on a corpus of how much time each word is followed by each it wouldn’t be stronger than keyboard word predictions. To make accurate suggestions requires emergence of primitive reasoning on the semantics of the tokens, LLM neural networks (transformers) can be analyzed to find subnetworks dedicated to modeling reality. It is still probability, but saying it’s just probability is not faithful

    • hesh@quokk.au
      link
      fedilink
      English
      arrow-up
      1
      ·
      2 hours ago

      It’s still just predicting the next token, it’s just using more past data points than your keyboard. The rest of the phenomena are emergent from that. I think it’s important to keep that in mind given how much they can imitate human reasoning.