The West Forgot How to Build. Now It's Forgetting Code

HaraldvonBlauzahn@feddit.org · 1 day ago

The West Forgot How to Build. Now It's Forgetting Code

e8d79@discuss.tchncs.de · 20 hours ago

So how would I create such an “Open Source” model? They don’t share the data used to create them do they? Let’s not even get started on how much computing power I would need to train one of those things. These selfhosted models solve nothing except some data privacy issues. Sure you no longer send all your code to a shady AI company but you are still 100% dependent on them sharing their models.

The_Decryptor@aussie.zone · edit-2 19 hours ago

So how would I create such an “Open Source” model? They don’t share the data used to create them do they?

No, and going by the OSI definition of “open source AI” they don’t have to, acknowledging that the training material is often copyrighted and can’t be shared.

It’s a strange definition of “open source”, one where you’re not actually allowed to see the source.

ikt@aussie.zone · edit-2 17 hours ago

The model is named Apertus – Latin for “open” – highlighting its distinctive feature: the entire development process, including its architecture, model weights, training data and methods, is openly accessible and fully documented.

https://ethz.ch/en/news-and-events/eth-news/news/2025/09/press-release-apertus-a-fully-open-transparent-multilingual-language-model.html

There is also a move into synthetic data and human trained so we will have to see where the training data goes copyright wise in the future

ikt@aussie.zone · 19 hours ago

Do you build your own Linux from scratch? If so why would you assume you can build an LLM from scratch?

qqq@lemmy.world · 18 hours ago

It’s mad easy to build your own Linux from scratch in comparison to building an LLM. You can have your own distro running in like an hour. With buildroot you can have it in even less than that.

ikt@aussie.zone · 18 hours ago

I have no idea what you’re talking about

qqq@lemmy.world · 18 hours ago

… Then why did you use it as an example?

ikt@aussie.zone · 17 hours ago

Because the average person is not building Linux from scratch nor would they know how to

qqq@lemmy.world · 17 hours ago

The average person wouldn’t be building an open source LLM either. I don’t think I follow. I was just saying that your comparison wasn’t going to hit correctly at all due to how easy it actually is to build Linux and a full Linux distribution.

ikt@aussie.zone · edit-2 17 hours ago

The average person wouldn’t be building an open source LLM either

Yeah that’s why I’m saying:

Do you build your own Linux from scratch? If so why would you assume you can build an LLM from scratch?

The OP is basically saying it’s not really open source unless I can personally build it! Which I am saying I don’t think is a requirement of open source software (your personal ability to compile software does not negate from it it’s open sourceness)

tbh I wouldn’t have an idea on how to build either, they are way above my skill level, i have no idea how to make a linux distro either, but i’m certain most are open source

Today, we’re launching Unsloth Studio (Beta): an open-source, no-code web UI for training, running and exporting open models in one unified local interface.

https://unsloth.ai/docs/new/studio

This was only recently released, maybe in the future we’ll have training material uber compressed down in an open source format that anyone with the skill and knowledge can use and different ‘distro’ releases of LLM’s, we already have tons of smaller models especially from European Universities and others

The EuroHPC Joint Undertaking (JU) provides access to the computing time and support services offered by the EuroHPC AI Factories. The AI Factories are open to European users from various sectors, including industry, research, academia and public authorities.

https://digital-strategy.ec.europa.eu/en/policies/ai-factories

We are only like 3-4 years into AI going mainstream if that, afaik the heat death of the universe is at least 1000 years away, we have lots of time to work and improve on them, I can only wonder where they will be at in 100 years, so I try not to make any damning facebook boomer tier statements about the future