Deepseek is definitely worse than the best ChatGPT and Anthropic models. It’s especially evident in coding tasks, but it is also worse with hallucinations and reasoning generally, in my experience. But for basic language tasks or pointing me to research on a topic, it’s pretty good.
I just don’t see it unseating the big american firms without an additional push.
Deepseek is definitely worse than the best ChatGPT and Anthropic models. It’s especially evident in coding tasks, but it is also worse with hallucinations and reasoning generally, in my experience. But for basic language tasks or pointing me to research on a topic, it’s pretty good.
I just don’t see it unseating the big american firms without an additional push.
A lot of that has little to do with model capability and comes down to coding harnesses not meeting the expectations of the model. Here’s a great discussion regarding that https://xcancel.com/MrAhmadAwais/status/2050956678502420612
DeepSeek team is aware of the tooling gap and now they’re working on their own harness to close it https://deepseekv4pro.com/news/deepseek-code-harness-team-claude-code-rival-report
Whoa cool. Thanks for the info