Has anyone tried in organization to use self hosted llm models for agentic programming?
Im curious if it makes any sense. My organization spends fortune on tokens from us companies. I want to recommend something…
Has anyone tried in organization to use self hosted llm models for agentic programming?
Im curious if it makes any sense. My organization spends fortune on tokens from us companies. I want to recommend something…
As far as I understand, the only way to get anything resembling usable output for coding is with massive, expensive, labouriously hand-tuned models, not local ones.
Qwen 3.6 27B dense is really good. Very usable coding output
^^^ This. Tragically, locally run LLMs don’t even hold a candle to “good” cloud-based LLMs like Claude Code.
I see that qwen 3.5 has pretty good performance and can be run on macbook with 64GB ram
Qwen 3.6 is even better
I have played with qwen3-coder:30b for my hobby stuff running on my M5 max MacBook and it does alright. It is fast enough and I used ollama tools to let it request files. I haven’t used anything like Claude code to compare it to though, only a bit of the ChatGPT free tier stuff.
Deepseek is pretty good the few times I tried it.