BDU IR

MULTI-LABEL CLASSIFICATION FOR OPTIONATED AMHARIC TEXTS USING DEEP LEARNING METHODS

Show simple item record

dc.contributor.author ZAKIR, KASSA HASSEN
dc.date.accessioned 2023-06-19T07:26:21Z
dc.date.available 2023-06-19T07:26:21Z
dc.date.issued 2023-02
dc.identifier.uri http://ir.bdu.edu.et/handle/123456789/15396
dc.description.abstract Text classification is a method for automatically assigning the best possible set of labels to a piece of text. Texts can be a highly rich source of information, but because they are unstructured, it can be challenging and time-consuming to find what we need. Texts are now classified using machine learning as multiple labels in order to solve this problem. One of this text is a toxic comment, which is a comment made during online contact that is unpleasant, insulting, or unfair and that prompts the user to stop using the page they are on and leave. The thesis provides an optionated text classification model based on deep learning techniques for classifying optionated texts and performing suitable measures for managing the negative effects caused on social media sites. The majority of previous research has been focused on single-level issues that only take into account mutually exclusive solutions. So, the major goal of our research is to use deep learning techniques to build a multi-label classification model for optionated Amharic texts. We build a multi-label text classification model for optionated Amharic text using deep learning methods and which are trained using 8k optionated Amharic texts. The model accepts optionated texts as input and classifies them into specified labels or categories based on the label of toxicity based on their content. This research uses a number of text processing tasks: - tokenization, normalization, stop word removal and stemming are the stages that we have applied for our dataset for text preparation. Finally, we have built word2vec and the experimental result of multi-label optionated Amharic texts classification models shows that, CNN, LSTM, Bi-LSTM and hybrid CNN-BiLSTM scores an accuracy of 91%, 85%, 90% and 94% respectively, is achieved by the proposed model. Based on the experimental result we propose the hybrid CNNBiLSTM model which has better classification accuracy, because of its better ability at handling context by combining the CNN and Bi-LSTM models. So, to improve the performance of the classification model for optionated Amharic texts we recommend as future research works the need to prepare standard dataset for experimentation. Key Word: Deep learning method, optionated text, multi-Label classification en_US
dc.language.iso en_US en_US
dc.subject Computing en_US
dc.title MULTI-LABEL CLASSIFICATION FOR OPTIONATED AMHARIC TEXTS USING DEEP LEARNING METHODS en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record