Voice recognition in python

Asked

Viewed 8,802 times

-2

I wanted suggestions from recognition libraries (when you speak and the computer writes) of voice in Python, which are windows compatible, easy to download and of course in Python 3

  • My question is about voice recognition (when you speak and the computer writes) n of voice synthesis(you write and the computer speaks)

  • 2

    I voted to close the question as outside the scope because, in my view, the answer would only be a link or would be too broad to answer. Anyway, in search of python Speech to text on Google you already find material to start your studies.

  • @Mat4guide my lack of attention, I withdrew the closing vote and answered.

  • obg @Guilhermenascimento

  • OK, @Andersoncarloswoss,and thank you for answering my question.

1 answer

3

There is the Speechrecognition supporting the following Apis/engines:

To install use the pip, if you have not installed download it, see this link. After installing Pip run this command, make sure you have the PyAudio (is one of the necessary resources) installing if you have not run the command:

python -m pip install pyaudio

Then then run:

pip install SpeechRecognition

After installed both you can test using the command:

python -m speech_recognition

Test example with Pyaudio:

C:\Users\guilherme\Desktop>python -m speech_recognition
A moment of silence, please...
Set minimum energy threshold to 257.2594761232556
Say something!
Got it! Now to recognize it...
You said hello stack overflow
Say something!
  • Say something! was the moment the software started and then I spoke into the microphone "hello stack overflow"
  • Got it! Now to recognize it... occurred when he understood what I said
  • You said informs what was said on the microphone

Requirements and resources needed to function

  • Python 2.6, 2.7 or 3.3+
  • Pyaudio 0.2.11+ (required for the microphone, which seems to be your case)
  • Pocketsphinx (required only if using Sphinx recognizer, recognizer_instance.recognize_sphinx)
  • Google API Client Library for Python (required only if using Google Cloud Speech API, recognizer_instance.recognize_google_cloud)
  • FLAC Encoder (required only on non-x86-based systems - Windows/Linux/OS X)

Examples

Examples can be downloaded at https://github.com/Uberi/speech_recognition/tree/master/examples

Have example for microphone or read audio file


API references

Follow the link with the API references https://github.com/Uberi/speech_recognition/blob/master/reference/library-reference.rst

As soon as possible I will list all here in Portuguese

Browser other questions tagged

You are not signed in. Login or sign up in order to post.