[HN Gopher] Show HN: Open-source real-time transcription playgro...
___________________________________________________________________
Show HN: Open-source real-time transcription playground (React,
Python, GCP)
Author : saharhash
Score : 57 points
Date : 2021-07-07 15:41 UTC (7 hours ago)
(HTM) web link (github.com)
(TXT) w3m dump (github.com)
| Johnyma22 wrote:
| You might also like Etherpad's implementation:
| https://blog.etherpad.org/2012/06/04/plugin-spotlight-speech...
| saharhash wrote:
| The plugin url doesn't seem to work anymore
| [deleted]
| dvfjsdhgfv wrote:
| I like these projects but they all have one flaw: the "open
| source" part refers only to the easy frontend, not to the backend
| doing the heavy lifting. With time, chances of CMUSphinx
| succeeding are getting lower because everybody is using
| proprietary APIs.
| saharhash wrote:
| That's a great idea as for the next engine to add. The plan is
| to have multiple ones and not only Google's Speech API.
| user_7832 wrote:
| If anyone is interested, Mozilla's Deepspeech
| (https://github.com/mozilla/DeepSpeech) is also quite a solid
| open source offline alternative, and runs quite well even on low-
| end hardware like a Raspberry Pi.
| saharhash wrote:
| Agreed, though what I was mainly missing with DeepSpeech was
| Speaker Diarization, which Google Speech API supports.
| trowngon wrote:
| Deepspeech project is closed by Mozilla. Developers fired. Now
| they are Coqui.
| hheikinh wrote:
| As an alternative to Google Cloud Speech check out
| https://www.speechly.com/. A lot easier to get started with and
| doesn't require a credit card. Has support for all the major
| browsers both desktop and mobile. If you are interested in a
| note-taking CRM application this demo is worth watching
| https://www.youtube.com/watch?v=6GcgPcMOuQk
| sairamkunala wrote:
| Voice to Text via https://speechtyping.com/voice-to-text-english
|
| I have bookmarked this a while ago. May not be open source, and
| does not use the internet to convert. Would work for many use
| cases, but wont be as good as Google's AI.
|
| works with just the browser with help of webkitSpeechRecognition
| built into the browser. Works on both Chrome and Safari (not
| Firefox).
___________________________________________________________________
(page generated 2021-07-07 23:01 UTC)