Current SOTA in local FOSS speech to text?

solrize@lemmy.ml · 3 days ago

Current SOTA in local FOSS speech to text?

skarn@discuss.tchncs.de · 3 days ago

On my potato powered laptop (mid range thinkpad from 2018) it does not run in real time on the CPU. Particularly if you want to use a decent model, which is needed for my foreign accent.

I would say that quality generally exceeds YouTube, even with the worst model.

solrize@lemmy.ml · 3 days ago

Thanks. My old i5-something server is probably in the same speed range as your laptop. It’s good to hear about the transcription quality. If conversion is slower than real time, I can live with it. I can just throw a bunch of files at it and let it run overnight. Faster is always nicer of course.