Ladies and gentleman, we have reached peak Agentic AI Coding - Goblin instructions in OpenAI's Codex system prompt

brianpeiris@lemmy.ca · edit-2 23 days ago

Ladies and gentleman, we have reached peak Agentic AI Coding - Goblin instructions in OpenAI's Codex system prompt

sudo@programming.dev · 23 days ago

I still can’t get over how the only fine tuning you can do for an LLM is yell at it with markdown files. We should be able to retrain local models so they can develop an actual experience without prefilling the context.

theunknownmuncher@lemmy.world · edit-2 23 days ago

I still can’t get over how the only fine tuning you can do for an LLM is yell at it with markdown files.

It isn’t.

We should be able to retrain local models so they can develop an actual experience without prefilling the context.

Great news, you can do exactly that.

jdr@lemmy.ml · 23 days ago

Not GPT5.1 though lol

theunknownmuncher@lemmy.world · edit-2 18 days ago

We should be able to retrain local models

local models

local

Is GPT5.1 a local model?

cecilkorik@piefed.ca · 23 days ago

But Microsoft can modify the Windows 11 source code. Or at least they used to be able to, before AI.

OpenAI should be able to re-train its poorly trained model. But of course it can’t, that would take months, maybe years of datacenter time.

Now OpenAI since can’t even re-train their own models, they resort to chastising it in its own system prompt.

This is the problem. If you’re trying to imply this is normal and expected, it shouldn’t be. It needs not to be. We cannot accept this as the normal way of doing things going forward. It is awful, and painfully stupid.

theunknownmuncher@lemmy.world · edit-2 22 days ago

OpenAI should be able to re-train its poorly trained model. But of course it can’t, that would take months, maybe years of datacenter time.

Why speak on subjects that you clearly have no knowledge or experience with?

Training is checkpointed and can be continued without retraining. Finetuning a model that has already been trained is a different process from training, and does not take months or years of datacenter time.

But Microsoft can modify the Windows 11 source code. Or at least they used to be able to, before AI.

Huh? It takes way more time and effort to develop new features and changes for software like Windows.

kurwa@lemmy.world · 23 days ago

Not with that attitude!

Ziglin (it/they)@lemmy.world · 23 days ago

Windows 11 isn’t running in the cloud yet though. Unless it checks to make sure it hasn’t been tampered with too much you should just be able to modify some of its binaries (the source code obviously isn’t available). With the cloud based llms that is not possible.

If you have a model on your computer you can retrain it, which is like changing a binary just far less precise. The option of having a source code equivalent just isn’t there beyond having the same dataset and seeds for the training program.

So I’d say it is worse than your average run of the mill proprietary software.

RamenJunkie@midwest.social · 23 days ago

How many extra tokens get burned with all this pre filled context I wonder.

Junkasaurus@lemmy.world · edit-2 22 days ago

deleted by creator

Bazoogle@lemmy.world · 22 days ago

Nope, it does the same thing:

Pi’s minimal system prompt and extensibility let you do actual context engineering. Control what goes into the context window and how it’s managed.

AGENTS.md: Project instructions loaded at startup from ~/.pi/agent/, parent directories, and the current directory.

SYSTEM.md: Replace or append to the default system prompt per-project.

Junkasaurus@lemmy.world · 22 days ago

deleted by creator

corbindallas@fedinsfw.app · 23 days ago

You can. Just not frontier models. Check out unsloth

sudo@programming.dev · 22 days ago

I’ve been using gguf models from unsloth but I haven’t seen anything from them on retraining. Especially with consumer hardware.

Eager Eagle@lemmy.world · 23 days ago

lol how do you think LLMs are trained in the first place?

thingsiplay@lemmy.ml · 23 days ago

I think he (or she) is talking about the user of the LLM, not the creator.

Eager Eagle@lemmy.world · edit-2 23 days ago

but you can, as long as it’s open weight. Fine tuning and training are pretty much the same process

thingsiplay@lemmy.ml · 23 days ago

That still falls into the category “creator” to me, if you need to rebuild. I was making the distinction to an end user, comparable to applications that you download and use and configure. Instead of rebuilding the source code with your modifications.

Do I misunderstand here something? Or is this a communication issue caused by different interpretations?

howrar@lemmy.ca · 22 days ago

If you define “user” to be a set that excludes anyone capable of modifying the weights, then by definition, no user can modify the weights.

Any criticism about users being unable to modify weights becomes vacuous, so it’s not an interpretation that makes sense.

thingsiplay@lemmy.ml · 22 days ago

I wasn’t criticizing at all. Just tried to define what I mean by creator and user. You was takling about “how do you think LLMs are trained” and I told you that the user was probably not thinking of who trains the LLMs, or fine tune them as you said. And yes, fine tuning the open weight falls into creation process, as they are rebuild. That is not the same as an end user who downloads the final usable product. And yes, it makes sense.

Eager Eagle@lemmy.world · 21 days ago

the original comment says “We should be able to retrain local models so they can develop an actual experience without prefilling the context.” - it turns out we can. Not sure why you’re trying to attach labels of user vs creator, when the premise already mentions retraining.