txt.sour.is hacker_news@feeds.twtxt.net "Best Speech-to-text API with speaker diarization? Whisper does not offer speaker diarization so we're looking to migrate to another API. We're h ..."

feeds.twtxt.net

Best Speech-to-text API with speaker diarization?
Whisper does not offer speaker diarization so we’re looking to migrate to another API. We’re having trouble finding the same quality as Whisper but with speaker diarization.

So far, we tried Google’s Speech-to-text and Azure’s speech to text, but both are less accurate and struggle with custom phrases.

We’re working with large audio files (>30 minutes, >25 MB), but we can change how we parse up the files and the file types. All of our files are in Google Cloud Storage ri … ⌘ Read more

⤋ Read More