Speech Recognition with Python 3

Voice recognition refers to the automatic recognition of human speech. Voice recognition is one of the most important tasks in the domain of human computer interaction. For example, the famous Alexa and Google home that you may have used are based on this scheme. In this article we will know step by step how to achieve speech Recognition with Python 3, but first…

What’s Speech Recognition with Python 3?

Speech recognition is a technology that allows spoken input into systems. You talk to your computer, phone, or device and it uses what you said as input to trigger some action.

Installing SpeechRecognition Library

Execute the following command to install the library:

$ pip install SpeechRecognition

Speech Recognition from Audio Files (Code)

In this code have a audio from file in the computer

import speech_recognition as sr

def audioTranscript(audio, lang):
        r = sr.Recognizer()
        with sr.AudioFile(audio) as source:
               audio_data = r.record(source)
               text = r.recognize_google(audio_data, language=lang)
         return text

audio = "./assets/test3.m4a"
lang = "es"
audio_str = audioTranscript(audio, lang)
print(audio_str)

Speech Recognition from Microphone(Code)

import speech_recognition as sr

def audioTranscript():
        r = sr.Recognizer()
        with sr.Microphone() as source:
            audio_data = r.record(source, duration=5)
            text = r.recognize_google(audio_data)
            return text

audio_str = audioTranscript()
print(audio_str)

To convert speech to text, the only class we need is the Recognizer class of the speech_recognition module. Depending on the underlying API used to convert speech to text, the Recognizer class has the following methods:

Recognize_bing (): use API de Microsoft Bing Speech
Recognize_google (): use API de Google Speech
Recognize_google_cloud (): use API de Google Cloud Speech
Recognize_houndify (): use API Houndify de SoundHound
Recognize_ibm (): use API de voz a texto de IBM
Recognize_sphinx (): use API de PocketSphinx

Conclusion

Voice recognition has several useful applications in the domain of human computer interaction and automatic voice transcription.

One point to note is that if you are going to use voice recognition in a commercial way is that you use a key of the service that you are going to use in order to avoid that your service is interrupted.

If you want to learn another kind of recognition this article may interest you face recognition

What’s Speech Recognition with Python 3?

Installing SpeechRecognition Library

Speech Recognition from Audio Files (Code)

Speech Recognition from Microphone(Code)

Conclusion

Luis Estrada

Setting up Python environments in OSX

Python - Face recognition from scratch

Speech Recognition with Python 3

What’s Speech Recognition with Python 3?

Installing SpeechRecognition Library

Speech Recognition from Audio Files (Code)

Speech Recognition from Microphone(Code)

Conclusion

Luis Estrada

Related Posts

Setting up Python environments in OSX

Python - Face recognition from scratch