The defense industry lost the ability to make weapons when crisis hit. The same pattern is eroding software engineering skills. The timelines are identical.
So how would I create such an “Open Source” model? They don’t share the data used to create them do they?
No, and going by the OSI definition of “open source AI” they don’t have to, acknowledging that the training material is often copyrighted and can’t be shared.
It’s a strange definition of “open source”, one where you’re not actually allowed to see the source.
The model is named Apertus – Latin for “open” – highlighting its distinctive feature: the entire development process, including its architecture, model weights, training data and methods, is openly accessible and fully documented.
No, and going by the OSI definition of “open source AI” they don’t have to, acknowledging that the training material is often copyrighted and can’t be shared.
It’s a strange definition of “open source”, one where you’re not actually allowed to see the source.
https://ethz.ch/en/news-and-events/eth-news/news/2025/09/press-release-apertus-a-fully-open-transparent-multilingual-language-model.html
There is also a move into synthetic data and human trained so we will have to see where the training data goes copyright wise in the future