+ 1

How to convert .wav audio to text using python with words having timestamps

I want the speech recognition to insert a time stamp to each word how can I achieve that. https://code.sololearn.com/cG0j0Ik4qxoz/?ref=app

python3

4th Oct 2020, 1:27 PM

World Friend's

2 Answers

+ 1

The best approach seems to be to recognize reasonably short snippets of the audio so you find roughly when things are said. For example, you can pass every 10 second interval to the speech recognition function and you'll know that what was recognized was said somewhere in that 10 second interval. The following article shows that you can specify duration=10 when recording from a microphone: https://stackabuse.com/introduction-to-speech-recognition-with-JUMP_LINK__&&__python__&&__JUMP_LINK/ You could chop up a wav audio into short segments in a similar way.

6th Oct 2020, 5:48 AM

Josh Greig