Claude: papers please?

SuspciousCarrot78@lemmy.world · edit-2 13 hours ago

Claude: papers please?

steel_for_humans@piefed.social · 11 hours ago

Say I have a GPU with 32GB VRAM and I am on Linux, what local LLM would be good for coding?

Currently I just have an iGPU ;) but that’s always an option, albeit a very expensive one.

andrew0@lemmy.dbzer0.com · 10 hours ago

Get llama.cpp and try Qwen3.6-35B-A3B. Just came out and looks good. You’ll have to look into optimal settings, as it’s a Mixture of Experts (MoE) model with only 3B parameters active. That means that the rest can stay in RAM for quick inference.

You could also try the dense model (Qwen3.5-27B), but that will be significantly slower. Put these in a coding harness like Oh-My-Pi, OpenCode, etc. and see how it fares for your tasks. Should be ok for small tasks, but don’t expect Opus / Sonnet 4.6 quality, more like better than Haiku.

SuspciousCarrot78@lemmy.world · edit-2 10 hours ago

Sadly…none. Well, I mean…it depends what you mean by “coding”. If you mean “replace Claude with local?”. Then…none. Sorry.

If you mean “actually, if I use ECA to call a cloud model from OpenRouter for planning, then have it direct a local LLM to do the scutt work”, then the Qwen series of models (like Qwen 3 Next) are pretty awesome.

The iGPU will make you want to kill yourself though. Get a GPU :) Even a 4-16GB one can make a difference.

PS: You said GPU and iGPU, so I’m not sure which one has the 32GB or what rig your running. I have suspicion though you’re running on a i5 or i7 with something like a intel 630 igpu inbuilt? In which case, the iGPU is pretty slow and depending on the exact chip, you likely won’t be able to use CUDA or Vulkan acceleration.

So, the “get a GPU” thing still holds :)

steel_for_humans@piefed.social · 10 hours ago

I meant that I can buy one of those Radeons dedicated to AI work, like the ASRock Radeon AI PRO R9700 Creator 32GB GDDR6. If I need to.

Currently my Ryzen iGPU is all I need, because all I need is to see the graphical desktop environment on my screen ;) It does the job well.

I use Claude Code as well and I am slightly concerned with that ID verification news, even more so because of the technology partner that they chose.

SuspciousCarrot78@lemmy.world · 10 hours ago

Hmm. The R9700 is RDNA4 - ROCm support for that architecture may be patchy in linux? Dunno. Check that before you commit your hard earned dollary-doos.

If all good

Qwen2.5-Coder-32B fits comfortably and is genuinely capable.
Qwen3.5-27B (dense)
Qwen3.5-35B-A3B (MoE, only 3B active parameters)
Qwen3.6-35B-A3B just dropped

Qwen 3.6 is the latest hotness. I’d start from there and work backwards

https://inv.nadeko.net/embed/YKNvkBbRJIE?

https://www.youtube.com/watch?v=YKNvkBbRJIE