As AI gets more expensive, popularity of Chinese AI models like DeepSeek is growing fast, even though these models are not as good as ChatGPT, Claude and Gemini. Industry experts believe the cost pressure is real and most companies will move to a hybrid model for AI use.
Nvidia sees the writing in the wall too, hence the big Nemotron effort now. They’ve been pushing open models, but no one can hear them over Altman’s lies.
AMD… is… trying.
Some other companies have made pretty interesting efforts too, like LG and IBM. Huawei already publish a big model to promote their ASICs, and is planning another in weeks. Even some Russian company trained a big open LLM from scratch, though it wasn’t very good TBH.
And this is not even looking outside the LLM space, where all sorts of interesting models are published.
I’m pressing X to doubt.
Yeah, NVIDIA knows they have to pivot to the consumer market soon. Apple seems to be going in that direction as well.
Oh don’t mistake me, they are not consumer friendly.
They are just trying to sell enterprise GPUs directly to “consumer” businesses and the cloud providers they use, instead of through literally fraudulent middlemen like OpenAI.
This is what pretty much everyone with hardware is doing, including Huawei, Tenstorrent, Cerebras, even AMD. Maybe I misinterpreted you, but hardly anyone cares about individual self-hosters.
Apple does, though. MLX is actually getting pretty cool. But they’ll always be quite insular, anti-consumer in other ways, and they still seem detached from what the community is largely doing.
My view is that we’re basically in the mainframe era of AI, but local models are already getting good enough to do useful stuff. Qwen 3.6 in particular is very capable, and you can do real work with it. So, extrapolate this into a couple of years into the future and it’s almost certain that we’ll be able to run models that perform as well as current frontier models locally. And that means companies are going to be much more likely to self host as well. In fact, I think you’re completely right that the immediate target will be business customers that want to self host their own models before this tech really gets to consumer grade.
Yeah. I mean, I have a Ryzen desktop and a 2020 GPU, and Mimo 2.5 is a bit faster and mind bogglingly better than frontier models from like… two years ago? And frontier models are plateauing, I think.
Still, my worry is that we consumer won’t HAVE any hardware. Many don’t even own a laptop these days, and it feels like they’ll just drop desktops (and work will just use thin clients) if they’re too cost prohibitive for people to buy.