Combination of Lexicon Based and Machine Learning Techniques in the Development of Political Tweet Sentiment Analysis Model

  • Liyana Safra Zaabar
  • Mohd Ridzwan Yaakub
  • Muhammad Iqbal Abu Latiffi
Keywords: Lexicon Based Approach, Sentiment Analysis, Opinion Mining, Twitter, Machine Learning, Feature Extraction, TF-IDF, Naive Bayes, Political Tweet

Abstract

Twitter is a popular micro blogging social media platform and the largest data contributor in the analysis of political sentiments in the United States especially in Presidential elections. Lack of labeled data as well as requirements of testing data are major problems in political domain since due to their constant change according to current events. The contribution of this study is to compare two dictionary-based Lexicon approaches which are Bing Liu Opinion Lexicon and Textblob for tweets labelling. Some comparative models have been developed. Model based on Bing Liu Opinion Lexicon which used machine learning algorithm TF-IDF for feature extraction and also classified with Naïve Bayes gets the highest F1-Score with 93%, outperformed our baseline model with score of only 68%. Test results have shown the effectiveness of combining lexicon approaches and machine learning algorithms in the development of sentiment analysis model.

Published
18-10-2022
How to Cite
Liyana Safra Zaabar, Mohd Ridzwan Yaakub, & Muhammad Iqbal Abu Latiffi. (2022). Combination of Lexicon Based and Machine Learning Techniques in the Development of Political Tweet Sentiment Analysis Model. International Journal of Synergy in Engineering and Technology, 3(2), 72-83. Retrieved from https://www.ijset.tatiuc.edu.my/index.php/ijset/article/view/143