Local LLM agents

Kkk2237pl@lemmy.world · 2 days ago

Local LLM agents

SeductiveTortoise@piefed.social · edit-2 2 days ago

Apple unified memory shares all over CPU, GPU and NPU, you can assign a lot of memory to run local models and there bandwidth is good, depending on the model.

AMD has something similar with their something something AI CPUs and they go up to 128GB at the moment. Apple can be way faster though. And you were able to buy a Mac Studio with 512GB back when RAM wasn’t worth more than unicorn pee. For… I guess 10k though.

87Six@lemmy.zip · 1 day ago

Apple unified memory shares

That’s cool asf.

Apple engineers with better leadership could change the fucking world… But instead they’re used to screw over their own user base.

If my GPU starts falling back to RAM my game fps drops to 1 lol.

PeeOnYou [he/him]@lemmygrad.ml · 13 hours ago

its shared sure, but the bandwidth is crap compared to a dedicated nvidia card. the performance will suffer, even though it allows you to run larger quants

87Six@lemmy.zip · 3 hours ago

Oh…