Jambi’s mission is to transcribe audio to your clipboard, as quickly and accurately as possible, while staying privacy-focused and open-source.
Jambi aims to help computer users with disabilities, such as vision or physical impairments, by providing real-time transcription of their speech. It’s also a great tool for anyone who wants to transcribe audio quickly and easily.
This is the alpha release and the project is still in early development. Currently looking for feedback and contributors. If you are a developer, you can contribute to the project by submitting pull requests or reporting issues.
If you like the project, please show your support by leaving a star. Thanks! https://github.com/guttermonk/jambi
I’m not familiar with whisper.c++ but I did try faster-whisper. Unfortunately, the transcriptions took upward of 40sec and it didn’t offer live transcription, which is a nice feature of vosk. There’s a comparison in the readme with other differences. That said, it should be relatively modular. It shouldn’t take much to swap it back to whisper if that’s what you prefer to use. Whisper is in the nix flake as optional, and the program allows you to change models but i haven’t bothered trying to switch back to Whisper since Vosk has been more performant.