SuspiciousCarrot78@aussie.zone to Selfhosted@lemmy.worldEnglish · 22 hours agoDo you host your own AI?message-squaremessage-square173fedilinkarrow-up1141file-text
arrow-up1141message-squareDo you host your own AI?SuspiciousCarrot78@aussie.zone to Selfhosted@lemmy.worldEnglish · 22 hours agomessage-square173fedilinkfile-text
minus-squareSuspiciousCarrot78@aussie.zoneOPlinkfedilinkEnglisharrow-up9·11 hours agoLlama.cpp or death!
minus-squaretristynalxander@mander.xyzlinkfedilinkEnglisharrow-up1·4 hours agoIt’s not that hard to use llama.cpp directly anyway. Why would I use a wrapper when I can just run a python script?
minus-squarebrucethemoose@lemmy.worldlinkfedilinkEnglisharrow-up1·edit-25 hours agoOr exllama! Vllm, sglang, Lorax. Koboldcpp, Aphrodite, text-generation-webui, LM Studio, powerinfer, ktransformers, mlc-LLM, really whatever floats your boat. Just not ollama, specifically.
Llama.cpp or death!
It’s not that hard to use
llama.cppdirectly anyway. Why would I use a wrapper when I can just run a python script?Or exllama! Vllm, sglang, Lorax. Koboldcpp, Aphrodite, text-generation-webui, LM Studio, powerinfer, ktransformers, mlc-LLM, really whatever floats your boat. Just not ollama, specifically.