Voice Classification/Recognition vs Speech Recognition

Ashvini More
2 min readApr 14, 2021

--

“Hi! I am bond,James Bond!!” , Whenever we heard that phrase we remind the legend James Bond. Forever we know that voice and phrase. but this epic voice not able to understand by the Machine.

But If Machine can able to recognize you , greet you and convert your voice then, Everybody loves Siri!! right?

In Machine learning, there are numerous Speech recognition models are there. The famous and default is SpeechRecognition library.
Through some voice assistance, we could able to convert voice into text with modulations also.

pixbay.com

Here are simple and easy discrepancy between Voice Classification and Speech recognition while doing Machine Learning:

Voice Classification:

Aim : specifically classify voice into different classes with valid labels.
Prerequisites : data in the form of some voice clips of about 15 seconds each. Feature extraction of audio like spectral bandwidth,Mel-Frequency Cepstral Coefficients(MFCCs) with the help of python librosa library.
Training : Use ANN sequential algorithm to train the above images,train the model . You can use convolutional neural network also on spectogram.
These two algorithm gives accuracy almost 98%.
Finally, You can input any speech_recognition voice , Assistance will give the name of the person of voice.

Demo — Voice Classification

Speech Recognition :

Aim : convert speech into text and use that input for further use.
Prerequisites : Any audio or run time speech,Use microphone for run time speech.
Use simple speech_recognition python library with google api.
Give simple instructions to google assistance like open MS Word,search query in browser.
speech_recognition library convert command into text and feed as an instruction to our python functions.
Finally output will be print to your screen.

Demo — Speech Recognition

These are some basic steps that everyone can refer to start Voice and Speech recognition
Thank You!!

Happy Coding!!!:)

References:

1.Voice Classification : https://towardsdatascience.com/voice-classification-with-python-4bec6856d06a
2.Speech Recognition : https://realpython.com/python-speech-recognition/

--

--

Responses (1)