Unless you are retraining the model locally at your 23 acre data center in your garage after every interaction, it’s still not learning anything. You are just dumping more data in to its temporary context.
And lots fit on personal computers dude, do you even know what different llms there are…?
One for programming doesn’t need all the fluff of books and art, so now it’s a manageable size. Llms are customizable to any degree, use your own data library for the context data even!
If it’s constantly making an error, fix the context data dude. What about it an llm/ai makes you think this isn’t possible…? Lmfao, you just want to bitch about ai, not comprehend how they work.
“Customizing” is just dumping more data in to it’s context.
Yes, which would fix the incorrect coding issues. It’s not an llm issue, it’s too much data. Or remove the context causing that issue. These require a little legwork and knowledge to make useful. Like anything else.
You do understand that the model weights and the context are not the same thing right? They operate completely differently and have different purposes.
Trying to change the model’s behavior using instructions in the context is going to fail. That’s like trying to change how a word processor works by typing in to the document. Sure, you can kind of get the formatting you want if you manhandle the data, but you haven’t changed how the application works.
Because I work with LLMs daily. I understand how they work. No matter how much I type at an LLM, its behavior will never fundamentally change without regenerating the model. It never learns anything from the content of the context.
The model is the LLM. The context is the document of a word processor.
A Jr developer will actually learn and grow in to a Sr developer and will retain that knowledge as they move from job to job. That is fundamentally different from how an LLM works.
I’m not anti-AI. I’m not “crying” about their issues. I’m just discussing the from a practical standpoint.
Because I work with LLMs daily. I understand how they work.
Clearly you don’t, because context data modifies how the training data extrapolates.
You can use something, while not being educated on how to use it. And just using something does not mean you understand how they work. Your comments have made it QUITE clear that you have no idea.
People who just whing about AI and pretend they know how they work are the worst kind of people right now.
Where do you think the errors are coming from? From data bleed over, the word “coding” shows up in books, so yes the context would incorrectly pull book data too.
Or do you not realize coding books exist as well…? And would be in the dataset.
Because that’s how they work…? It’s not an actual physical book… you don’t seriously think this do you…? it’s the text data inside, like any other text file it would use for context.
Unless you are retraining the model locally at your 23 acre data center in your garage after every interaction, it’s still not learning anything. You are just dumping more data in to its temporary context.
Sounds like you have no clue what an LLM/AI actually is or is capable of.
https://medium.com/sciforce/step-by-step-guide-to-your-own-large-language-model-2b3fed6422d0
It’s not hard to keep a data library updated for context, and some are under a TB in siz.
Where are you getting your information from?
It seems you are still confusing context with training? Did you read that text and understand it?
Did you follow it yourself to build an llm?
I bet they had an LLM read it and summarize it for them
Why do you think it’s solely a training issue?
So, you did not? Ok
Can’t answer the question eh?
What a shocker.
If you can’t explain your or justify your side, I’ve got no time for people like you.
What part of customize did you not understand?
And lots fit on personal computers dude, do you even know what different llms there are…?
One for programming doesn’t need all the fluff of books and art, so now it’s a manageable size. Llms are customizable to any degree, use your own data library for the context data even!
What part about how LLMs actually work do you not understand?
“Customizing” is just dumping more data in to it’s context. You can’t actually change the root behavior of an LLM without rebuilding it’s model.
If it’s constantly making an error, fix the context data dude. What about it an llm/ai makes you think this isn’t possible…? Lmfao, you just want to bitch about ai, not comprehend how they work.
This is Lemmy, bitching about AI is the norm.
Yes, which would fix the incorrect coding issues. It’s not an llm issue, it’s too much data. Or remove the context causing that issue. These require a little legwork and knowledge to make useful. Like anything else.
You really don’t know how these work do you?
You do understand that the model weights and the context are not the same thing right? They operate completely differently and have different purposes.
Trying to change the model’s behavior using instructions in the context is going to fail. That’s like trying to change how a word processor works by typing in to the document. Sure, you can kind of get the formatting you want if you manhandle the data, but you haven’t changed how the application works.
Why are you so focused on just the training? The data is ALSO the issue.
Of course if you ignore one fix, that works, of course you can only cry it’s not fixable.
But it is.
Because I work with LLMs daily. I understand how they work. No matter how much I type at an LLM, its behavior will never fundamentally change without regenerating the model. It never learns anything from the content of the context.
The model is the LLM. The context is the document of a word processor.
A Jr developer will actually learn and grow in to a Sr developer and will retain that knowledge as they move from job to job. That is fundamentally different from how an LLM works.
I’m not anti-AI. I’m not “crying” about their issues. I’m just discussing the from a practical standpoint.
LLMs do not learn.
Clearly you don’t, because context data modifies how the training data extrapolates.
You can use something, while not being educated on how to use it. And just using something does not mean you understand how they work. Your comments have made it QUITE clear that you have no idea.
People who just whing about AI and pretend they know how they work are the worst kind of people right now.
Odd, I can say the exact same thing about your comments on the subject.
We are clearly at an impasse that won’t be solved through this discussion.
But
Is not inside the context, that comes from training. So you know how an llm works?
Where do you think the errors are coming from? From data bleed over, the word “coding” shows up in books, so yes the context would incorrectly pull book data too.
Or do you not realize coding books exist as well…? And would be in the dataset.
Why would you put whole books into the context?!? Do you even know what an llm is?
Because that’s how they work…? It’s not an actual physical book… you don’t seriously think this do you…? it’s the text data inside, like any other text file it would use for context.
Where do you think it gets its data from…?
From the training.
I will stop now replying to, because you clearly need to learn more about llms.
Here, have a fish 🐟