In case you missed it, ChatGPT 5.1 had a tendency to talk about “goblins” in its responses. Supposedly this was a result of training a “nerdy” personality, but it bled into the model as a whole. Because the training run for the latest model already had this flaw, they had to add specific instructions to the system prompt for their Codex coding tool to avoid this behaviour.

Here’s the full prompt from their github. In fact, they repeated the goblin instructions twice, cos you know that will definitely fix it. It’s an interesting read if you consider each one of these instructions were meant to prevent some undesired behaviour: https://paste.sh/Iev3HtMe#JZ4dw_CkvJcpVmjjoy7WZnSn

More info here: https://news.northeastern.edu/2026/05/06/chatgpt-goblins-problem-ai-behavior/

OpenAI’s own blog post casually explaining why they couldn’t predict that their state of the art model would obsess about goblins: https://openai.com/index/where-the-goblins-came-from/

  • theunknownmuncher@lemmy.world
    link
    fedilink
    arrow-up
    52
    ·
    edit-2
    2 days ago

    I still can’t get over how the only fine tuning you can do for an LLM is yell at it with markdown files.

    It isn’t.

    We should be able to retrain local models so they can develop an actual experience without prefilling the context.

    Great news, you can do exactly that.

        • cecilkorik@piefed.ca
          link
          fedilink
          English
          arrow-up
          13
          ·
          1 day ago

          But Microsoft can modify the Windows 11 source code. Or at least they used to be able to, before AI.

          OpenAI should be able to re-train its poorly trained model. But of course it can’t, that would take months, maybe years of datacenter time.

          Now OpenAI since can’t even re-train their own models, they resort to chastising it in its own system prompt.

          This is the problem. If you’re trying to imply this is normal and expected, it shouldn’t be. It needs not to be. We cannot accept this as the normal way of doing things going forward. It is awful, and painfully stupid.

          • theunknownmuncher@lemmy.world
            link
            fedilink
            arrow-up
            3
            ·
            edit-2
            1 day ago

            OpenAI should be able to re-train its poorly trained model. But of course it can’t, that would take months, maybe years of datacenter time.

            Why speak on subjects that you clearly have no knowledge or experience with?

            Training is checkpointed and can be continued without retraining. Finetuning a model that has already been trained is a different process from training, and does not take months or years of datacenter time.

            But Microsoft can modify the Windows 11 source code. Or at least they used to be able to, before AI.

            Huh? It takes way more time and effort to develop new features and changes for software like Windows.

        • Ziglin (it/they)@lemmy.world
          link
          fedilink
          English
          arrow-up
          6
          ·
          2 days ago

          Windows 11 isn’t running in the cloud yet though. Unless it checks to make sure it hasn’t been tampered with too much you should just be able to modify some of its binaries (the source code obviously isn’t available). With the cloud based llms that is not possible.

          If you have a model on your computer you can retrain it, which is like changing a binary just far less precise. The option of having a source code equivalent just isn’t there beyond having the same dataset and seeds for the training program.

          So I’d say it is worse than your average run of the mill proprietary software.