TIL: There is an open source "Alexa replacement" project

cm0002 · 8 months ago

TIL: There is an open source "Alexa replacement" project

fonix232@fedia.io · 8 months ago

Aye, I was actually hoping to use the NPU for TTS/STT while keeping the LLM systems GPU bound.

brucethemoose@lemmy.world · edit-2 8 months ago

It still uses memory bandwidth, unfortunately. There’s no way around that, though NPU TTS would still be neat.

…Also, generally, STT responses can’t be streamed, so you mind as well use the iGPU anyway. TTS can be chunked I guess, but do the major implementations do that?

fonix232@fedia.io · 8 months ago

Piper does chunking for TTS, and could utilise the NPU with the right drivers.

And the idea of running them on the NPU is not about memory usage but hardware capacity/parallelism. Although I guess it would have some benefits when I don’t have to constantly load/unload GPU models.

brucethemoose@lemmy.world · 8 months ago

Oh, I forgot!

You should check out Lemonade:

https://github.com/lemonade-sdk/lemonade

It’s supports Ryzen NPUs via 2 different runtimes… though apparently not the 8000 series yet?

fonix232@fedia.io · 8 months ago

I’ve actually been eyeing lemonade, but the lack of Dockerisation is still an issue… guess I’ll just DIY it at one point.

brucethemoose@lemmy.world · edit-2 8 months ago

It’s all C++ now, so it doesn’t really need docker! I don’t use docker for any ML stuff, just pip/uv venvs.

You might consider Arch (dockerless) ROCM soon; it looks like 7.1 is in the staging repo right now.

TIL: There is an open source "Alexa replacement" project

TIL: There is an open source "Alexa replacement" project

Home