Ogunfolajin Maruff , Tunde (2022) Adoption of machine learning algorithm for analysing supporters and non supporters feedback on political posts / Ogunfolajin Maruff Tunde. Masters thesis, Universiti Malaya.
PDF (The Candidate's Agreement) Restricted to Repository staff only Download (219Kb) | |
PDF (Thesis M.A.) Download (715Kb) |
Abstract
Sentiment Analysis is a field that deals with the problem of identifying and extracting sentiment (or opinion) from data (particularly textual data). Studies have shown how user perception can have a strong influence on policies and decision-making processes in a place, society, and nation. This thesis is based on the application of sentiment classification algorithm to tweet data with the goal of classifying messages based on the polarity of sentiment towards a particular topic (or subject matter). Political analysts often communicate with the public and exchange information through the social media platform. Their activities (otherwise termed cyber-trooping) could have either positive, negative, or neutral feedbacks (perceptions) in the public space. Thus, there is a need to automate the process of identifying and predicting (positive, negative, or neutral class) these cyber-trooping data. This work employed the use of machine learning approach. Four conventional classification algorithms: naïve bayes (NB), support vector machines (SVM), nearest neighbor (k-NN), and decision trees (J48) classifiers are implemented in identifying and categorizing tweet data of three political figures in Malaysia: Dato Seri Anwar, Dato Hadi Awang, and Lim Guang Eng, as either positive, negative, or neutral perceptions. The method was implemented using Java and the results of the simulation were evaluated using five standard performance metrics: accuracy, AUC, precision, recall, and f-Measure. The support vector machines (SVM) algorithm obtained the overall best results of 94.5% accuracy, 91.8% precision, 91.7% recall, and 91.1% f-Measure while the naïve bayes (NB) algorithm obtained the best AUC score of 0.944 with the tweet data of Dato Seri Anwar.
Item Type: | Thesis (Masters) |
---|---|
Additional Information: | Dissertation (M.A.) – Faculty of Computer Science & Information Technology, Universiti Malaya, 2022. |
Uncontrolled Keywords: | Cyber-trooper; Perception, Sentiment; Twitter; Algorithms; Naïve bayes; Support vector machine, Nearest neighbor; Decision trees |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science Q Science > QA Mathematics > QA76 Computer software |
Divisions: | Faculty of Computer Science & Information Technology > Dept of Software Engineering |
Depositing User: | Mr Mohd Safri Tahir |
Date Deposited: | 07 Oct 2024 06:13 |
Last Modified: | 07 Oct 2024 06:13 |
URI: | http://studentsrepo.um.edu.my/id/eprint/15304 |
Actions (For repository staff only : Login required)
View Item |