Their reputation is also a bit in the toilet, because people hear “AI” and think of ChatGPT.
So “man hospitalised after AI suggested he put glue on pizza for tackiness” would have people think he was using it, when he might well have been using a different LLM.
And, as I understand it, Anthropic hasn’t committed as much spending to building out new data centers, and has setup their operations to be GPU agnostic, so they can keep flexibility between NVIDIA GPUs, Google TPUs, and Amazon Trainium, and play the data center pricing game. Anthropic is better positioned to survive an AI winter (and I believe it’s coming soon).
Distillation isn’t stealing the original model, though. It just uses the models to make synthetic training data to train their own thing. They aren’t stealing the model itself.
It also doesn’t seem like as big a deal as Anthropic and Open AI make it look, IMO. Them treating it like a national security issue where the company gets its models stolen from under its nose just comes across like a media company claiming that every download is a copy they would otherwise have sold at full price, and thus they have accrued trillions of dollars in damages.
I could, in theory, take a bunch of google Gemini outputs, and train a GPT-2 model on them. That doesn’t mean that I’ve recreated Gemini, nor does it mean that i’ve stolen it from Google, either.
To top it all off, it’s not like their services were abused. The companies were presumably paid appropriately for the usage.
The field is moving so fast that things can change quickly, but the American labs are so caught up in saddling their models with safety overhead that the recent Chinese models are very close in practical use to the flagship American models if not pulling ahead (Sora vs Seedance 2).
I don’t really need to solve Erdős problems in my day to day. Outside of increasingly edge case eval competition, I’m not sure what OpenAI brings that literally everyone else isn’t also capable of providing (and more).
I’d maybe invest in Anthropic for an IPO if they turned around their own saddling of models and played nicer with open platforms, but if Claude is just going to get more and more anxious due to excessive red teaming and CC fall further and further behind stuff like Hermes Agent, they too are going to fall by the wayside as open models become the dominant inference for open infrastructure.
China won the game. Their models are cheap and available and free weights.
Openai will never make any money. They realized it’s a high time to sell so. Wouldnt give a dime
They’re also behind Anthropic when it comes to expensive frontier models.
I wouldn’t buy their stock even if I was looking to invest in AI.
Their reputation is also a bit in the toilet, because people hear “AI” and think of ChatGPT.
So “man hospitalised after AI suggested he put glue on pizza for tackiness” would have people think he was using it, when he might well have been using a different LLM.
And, as I understand it, Anthropic hasn’t committed as much spending to building out new data centers, and has setup their operations to be GPU agnostic, so they can keep flexibility between NVIDIA GPUs, Google TPUs, and Amazon Trainium, and play the data center pricing game. Anthropic is better positioned to survive an AI winter (and I believe it’s coming soon).
Their models may also be based on US models.
It’ll be hard for derivative models to innovate if their host organism has died.
Distillation isn’t stealing the original model, though. It just uses the models to make synthetic training data to train their own thing. They aren’t stealing the model itself.
Plus, a lot of companies do it. Anthropic’s Claude was calling itself DeepSeek for a while.
It also doesn’t seem like as big a deal as Anthropic and Open AI make it look, IMO. Them treating it like a national security issue where the company gets its models stolen from under its nose just comes across like a media company claiming that every download is a copy they would otherwise have sold at full price, and thus they have accrued trillions of dollars in damages.
I could, in theory, take a bunch of google Gemini outputs, and train a GPT-2 model on them. That doesn’t mean that I’ve recreated Gemini, nor does it mean that i’ve stolen it from Google, either.
To top it all off, it’s not like their services were abused. The companies were presumably paid appropriately for the usage.
Removed by mod
Removed by mod
Removed by mod
Removed by mod
Removed by mod
Removed by mod
Removed by mod
Removed by mod
Removed by mod
Removed by mod
Removed by mod
Removed by mod
Removed by mod
Removed by mod
It’s true.
The field is moving so fast that things can change quickly, but the American labs are so caught up in saddling their models with safety overhead that the recent Chinese models are very close in practical use to the flagship American models if not pulling ahead (Sora vs Seedance 2).
I don’t really need to solve Erdős problems in my day to day. Outside of increasingly edge case eval competition, I’m not sure what OpenAI brings that literally everyone else isn’t also capable of providing (and more).
I’d maybe invest in Anthropic for an IPO if they turned around their own saddling of models and played nicer with open platforms, but if Claude is just going to get more and more anxious due to excessive red teaming and CC fall further and further behind stuff like Hermes Agent, they too are going to fall by the wayside as open models become the dominant inference for open infrastructure.