Designing Noise-Resistant Ethiopian Spoken Languages Identification Model Using Machine Learning Approach

Getye, Demil Derese

dc.contributor.author	Getye, Demil Derese
dc.date.accessioned	2021-10-14T06:07:30Z
dc.date.available	2021-10-14T06:07:30Z
dc.date.issued	2020-08-26
dc.identifier.uri	http://ir.bdu.edu.et/handle/123456789/12740
dc.description.abstract	Spoken language identification is the process of deciding which language a speaker is speaking. Spoken language identification is used as a front-end processing in human-computer interaction, speech to text translation, speech to speech translation, and automatic caller routing to the intended operator. Lots of studies on spoken language identification were done using a Gaussian mixture model, i-vector, and neural network approaches. However, a Gaussian mixture model and i-vector approaches are not robust in the noise environment. Even though a deep neural network has better performance in short utterance, it is computationally expensive. In order to overcome these problems, we propose a noise-resistant Ethiopian spoken language identification model for Amharic, Tigrigna, Oromia, and Somalia languages. For the dataset, we have used a noisy data from meetings, discussions, conferences, and reports. Since back propagation neural network are slow, we proposed a feed-forward neural network and convolutional neural network based models. In the first model, an acoustic feature with a feed-forward neural network classifier was used. In this method, we compared five acoustic features and we found a better accuracy of 88% with delta Mel frequency cepstral coefficient. The second method we used an end to end convolutional neural network and convolutional neural network with a support vector machine. We found an accuracy of 98% in the end to end convolutional neural network and 97% in the convolutional neural network with support vector machine. So, the support vector machine can improve the training time of the convolutional neural network without significantly degrading the accuracy.	en_US
dc.language.iso	en_US	en_US
dc.subject	INFORMATION TECHNOLOGY	en_US
dc.title	Designing Noise-Resistant Ethiopian Spoken Languages Identification Model Using Machine Learning Approach	en_US
dc.type	Thesis	en_US