As Snowden told us, video and audio recording capabilities of your devices are NSA spying vectors. OSS/Linux is a safeguard against such capabilities. The massive datacenter investments in US will be used to classify us all into a patriotic (for Israel)/Oligarchist social credit score, and every mega tech company can increase profits through NSA cooperation, and are legally obligated to cooperate with all government orders.

Speech to text and speech automation are useful tech, though always listening state sponsored terrorists is a non-NSA targeted path for sweeping future social credit classifications of your past life.

Some small LLMs that can be used for speech to text: https://modal.com/blog/open-source-stt

  • fonix232@fedia.io
    link
    fedilink
    arrow-up
    0
    ·
    2 months ago

    Piper does chunking for TTS, and could utilise the NPU with the right drivers.

    And the idea of running them on the NPU is not about memory usage but hardware capacity/parallelism. Although I guess it would have some benefits when I don’t have to constantly load/unload GPU models.

      • fonix232@fedia.io
        link
        fedilink
        arrow-up
        0
        ·
        2 months ago

        I’ve actually been eyeing lemonade, but the lack of Dockerisation is still an issue… guess I’ll just DIY it at one point.

        • brucethemoose@lemmy.world
          link
          fedilink
          arrow-up
          0
          ·
          edit-2
          2 months ago

          It’s all C++ now, so it doesn’t really need docker! I don’t use docker for any ML stuff, just pip/uv venvs.

          You might consider Arch (dockerless) ROCM soon; it looks like 7.1 is in the staging repo right now.