October 7, 2019 feature

Speech recognition using artificial neural networks and artificial bee colony optimization

by Ingrid Fadelli , Tech Xplore

Over the past decade or so, advances in machine learning have paved the way for the development of increasingly advanced speech recognition tools. By analyzing audio files of human speech, these tools can learn to identify words and phrases in different languages, converting them into a machine-readable format.

While several machine learning-based models have achieved promising results on speech recognition tasks, they do not always perform well in all languages. For instance, when a language has a vocabulary with many similar-sounding words, the performance of speech recognition systems can decline considerably.

Researchers at Mahatma Gandhi Mission's College of Engineering & Technology and Jaypee Institute of Information Technology, in India, have developed a speech recognition system to tackle this problem. This new system, presented in a paper published in Springer Link's International Journal of Speech Technology, combines an artificial neural network (ANN) with an optimization technique known as opposition artificial bee colony (OABC).

"In this work, the default structure of ANNs is redesigned using the Levenberg-Marquardt algorithm to retrieve an optimal prediction rate with accuracy," the researchers wrote in their paper. "The hidden layers and neurons of the hidden layers are further optimized using the opposition artificial bee colony optimization technique."

A unique characteristic of the system developed by the researchers is that it uses an OABC optimization algorithm to optimize the ANN's layers and artificial neurons. As the name would suggest, artificial bee colony (ABC) algorithms are designed to simulate the behavior of honey bees to tackle a variety of optimization problems.

"Generally, optimization algorithms randomly initialize the solutions in the matching domain," the researchers explained in their paper. "But this solution could lie in the opposite direction of the best solution, thereby increasing the computational overhead significantly. Hence this opposition-based initialization is termed as OABC."

The system developed by the researchers considers individual words spoken by different people as an input speech signal. Subsequently, it extracts so-called amplitude modulation (AM) spectrogram features, which are essentially sound-specific characteristics.

The features extracted by the model are then used to train the ANN to recognize human speech. After it is trained on a large database of audio files, the ANN learns to predict isolated words in new samples of human speech.

The researchers tested their system on a series of human speech audio clips and compared it with more conventional speech recognition techniques. Their technique outperformed all the other methods, attaining remarkable accuracy scores.

"The sensitivity, specificity, and accuracy of the proposed method are 90.41 percent, 99.66 percent and 99.36 percent, respectively, which is better than all the existing methods," the researchers wrote in their paper.

In the future, the speech recognition system could be used to achieve more effective human-machine communication in a variety of settings. In addition, the approach they used to develop the system could inspire other teams to design similar models, which combine ANNs and OABC optimization techniques.

More information: Shilpi Shukla et al. A novel system for effective speech recognition based on artificial neural network and opposition artificial bee colony algorithm, International Journal of Speech Technology (2019). DOI: 10.1007/s10772-019-09639-0

Citation: Speech recognition using artificial neural networks and artificial bee colony optimization (2019, October 7) retrieved 24 April 2024 from https://techxplore.com/news/2019-10-speech-recognition-artificial-neural-networks.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Emotion recognition based on paralinguistic information

186 shares

Feedback to editors

Study shows potential of super grids when hurricanes overshadow solar panels

37 minutes ago

Rubber-like stretchable energy storage device fabricated with laser precision

45 minutes ago

On the trail of deepfakes, researchers identify 'fingerprints' of AI-generated video

57 minutes ago

New tech could help traveling VR gamers experience 'ludicrous speed' without motion sickness

2 hours ago

Why can't robots outrun animals?

3 hours ago

Virtual sensors help aerial vehicles stay aloft when rotors fail

3 hours ago

New insights lead to better next-gen solar cells

4 hours ago

Scientists pioneer new X-ray microscopy method for data analysis 'on the fly'

4 hours ago

Going with the flow: Research dives into electrodes on energy storage batteries

4 hours ago

Ultra-thin, flexible solar cells demonstrate their promise in a commercial quadcopter drone

4 hours ago

Load comments (0)

Speech recognition using artificial neural networks and artificial bee colony optimization

Study shows potential of super grids when hurricanes overshadow solar panels

Rubber-like stretchable energy storage device fabricated with laser precision

On the trail of deepfakes, researchers identify 'fingerprints' of AI-generated video

New tech could help traveling VR gamers experience 'ludicrous speed' without motion sickness

Why can't robots outrun animals?

Virtual sensors help aerial vehicles stay aloft when rotors fail

New insights lead to better next-gen solar cells

Scientists pioneer new X-ray microscopy method for data analysis 'on the fly'

Going with the flow: Research dives into electrodes on energy storage batteries

Ultra-thin, flexible solar cells demonstrate their promise in a commercial quadcopter drone

Emotion recognition based on paralinguistic information

Only few hundred training samples bring human-sounding speech in Microsoft TTS feat

Can you hear what I say? New findings on human speech recognition

A deep learning technique for context-aware emotion recognition

Google Brain posse takes neural network approach to translation

An approach for securing audio classification against adversarial attacks

On the trail of deepfakes, researchers identify 'fingerprints' of AI-generated video

Emulating neurodegeneration and aging in artificial intelligence systems

New tech could help traveling VR gamers experience 'ludicrous speed' without motion sickness

Scientists pioneer new X-ray microscopy method for data analysis 'on the fly'

Microsoft claims that small, localized language models can be powerful as well

With a game show as his guide, researcher uses AI to predict deception

Phys.org

Medical Xpress

Science X

Speech recognition using artificial neural networks and artificial bee colony optimization

Study shows potential of super grids when hurricanes overshadow solar panels

Rubber-like stretchable energy storage device fabricated with laser precision

On the trail of deepfakes, researchers identify 'fingerprints' of AI-generated video

New tech could help traveling VR gamers experience 'ludicrous speed' without motion sickness

Why can't robots outrun animals?

Virtual sensors help aerial vehicles stay aloft when rotors fail

New insights lead to better next-gen solar cells

Scientists pioneer new X-ray microscopy method for data analysis 'on the fly'

Going with the flow: Research dives into electrodes on energy storage batteries

Ultra-thin, flexible solar cells demonstrate their promise in a commercial quadcopter drone

Related Stories

Emotion recognition based on paralinguistic information

Only few hundred training samples bring human-sounding speech in Microsoft TTS feat

Can you hear what I say? New findings on human speech recognition

A deep learning technique for context-aware emotion recognition

Google Brain posse takes neural network approach to translation

An approach for securing audio classification against adversarial attacks

Recommended for you

On the trail of deepfakes, researchers identify 'fingerprints' of AI-generated video

Emulating neurodegeneration and aging in artificial intelligence systems

New tech could help traveling VR gamers experience 'ludicrous speed' without motion sickness

Scientists pioneer new X-ray microscopy method for data analysis 'on the fly'

Microsoft claims that small, localized language models can be powerful as well

With a game show as his guide, researcher uses AI to predict deception

Your Privacy