BDU IR

ANNOTATION AND CLASSIFICATION OF EXTREMIST ATTITUDE POSTS AND COMMENTS TEXT ON SOCIAL MEDIA IN AFAN OROMO USING SENTIMENT ANALYSIS

Show simple item record

dc.contributor.author ABRAHAM, EJIGU TAMENE
dc.date.accessioned 2024-12-05T07:13:48Z
dc.date.available 2024-12-05T07:13:48Z
dc.date.issued 2024-03
dc.identifier.uri http://ir.bdu.edu.et/handle/123456789/16270
dc.description.abstract Extremist attitudes on social media posts and comments are linked with other groups that propagate an individual, group, city, or country. This is a major issue. In the present day, Twitter, Facebook, YouTube, Instagram, TikTok, LinkedIn, and others are commonly used as social media platforms. Opinions expressed on such social media are a great problem, as is the spread of extremist ideas. Some of the information on these social media platforms is considered negative because it can be used to attack and insult people. We propose a machine learning approach to build a model for the classification of extremist attitude posts and comments on social media in Afan Oromo to categorize them as politics, religious, ethnic, race, and neutral. We have experimented with 14500 collected data from Facebook of BBC, OBN, FBC, OLF, KFO, and Taye of Afan Oromo pages using the Face Pager tool. From this dataset, we used train data of 11600, and for test data, we used 2900. We applied text processing tasks like lower case conversion, link removal, email removal, number removal, special character removal, short word removal, tokenization, and data annotation to the collected data. Then, for feature extraction, we used a counter vector and TF-IDF. Finally, we applied seven machine learning algorithms (RFC, SVM, LG, NB, KNC, DTC, and NN) to 80% of the train data and 20% of the test data to build a model to classify extremist attitude posts and comments on social media in Afan Oromo into categories such as politics, religious, race, ethnicity, or neutral. From these seven models, the Random Forest Classifier (RFC) with counter-vector feature extraction scored the highest performance. The model's performance was also evaluated using a performance test dataset, which yielded an f1 score of 95%. So, it is appropriate for the classification of extremist attitude posts and comments in Afan Oromo from social media to categories such as politics, religious, race, ethnic, or neutral extremist. Keywords: Afan Oromo, extremist attitude, extremism, sentiment analysis, deep learning, classification, Twitter, Facebook, annotation. en_US
dc.language.iso en_US en_US
dc.subject Computer Science en_US
dc.title ANNOTATION AND CLASSIFICATION OF EXTREMIST ATTITUDE POSTS AND COMMENTS TEXT ON SOCIAL MEDIA IN AFAN OROMO USING SENTIMENT ANALYSIS en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record