Jambi’s mission is to transcribe audio to your clipboard, as quickly and accurately as possible, while staying privacy-focused and open-source.
Jambi aims to help computer users with disabilities, such as vision or physical impairments, by providing real-time transcription of their speech. It’s also a great tool for anyone who wants to transcribe audio quickly and easily.
This is the alpha release and the project is still in early development. Currently looking for feedback and contributors. If you are a developer, you can contribute to the project by submitting pull requests or reporting issues.
If you like the project, please show your support by leaving a star. Thanks! https://github.com/guttermonk/jambi
I’m not a developer but I see this being immensely useful for a lot of people. Good luck to you! :)
That means a lot to me. Thanks so much for your kind words!
Is there any reason you chose vosk over whisper.c++?
I’m not familiar with whisper.c++ but I did try faster-whisper. Unfortunately, the transcriptions took upward of 40sec and it didn’t offer live transcription, which is a nice feature of vosk. There’s a comparison in the readme with other differences. That said, it should be relatively modular. It shouldn’t take much to swap it back to whisper if that’s what you prefer to use. Whisper is in the nix flake as optional, and the program allows you to change models but i haven’t bothered trying to switch back to Whisper since Vosk has been more performant.
I should also add that if you don’t want to reconfigure Jambi to use Whisper, you can try WhisperNow, which is built in Python and uses Whisper. I saw similar transcription performance when I used Whisper with Jambi and decided to move forward with Vosk after testing Whisper in both programs.
Probably just playing it safe since vosk has been around longer iirc Is kinda weird to rust and not choose the more performant whisper tho
Looks cool, I’ll definately try it out.
Any plans for an Android APK?
I use the Futo Keyboard on Android which already has this feature. It’s also opens source, and I’ve had a really good experience with it so far.
Nice, I’ll check it out
I believe futo keyboard might be more “source available” than open source, but it’s still a great app! (I’m typing with it now and it’s my favorite more open keyboard app I’ve used :)
If you want to use another keyboard app like the fantastic FOSS Heliboard, you can also just use the futo voice input app as a voice input provider :)
Futo apps are available through their own f-droid repo if you wanna install/update through an f-droid client- I figured I’d mention that just in case you go searching for it and don’t see it since it’s not in one of the common repos