Exploring the Performance Characteristics of the Naïve Bayes Classifier in the Sentiment Analysis of an Airline’s Social Media Data

Exploring the Performance Characteristics of the Naïve Bayes Classifier in the Sentiment Analysis of an Airline’s Social Media Data

Volume 5, Issue 4, Page No 266-272, 2020

Author’s Name: Mba Obasi Odima), Adewale Opeoluwa Ogunde, Bosede Oyenike Oguntunde, Samuel Ayodele Phillips

View Affiliations

Department of Computer Science, Redeemer’s University, Ede 23210, Nigeria

a)Author to whom correspondence should be addressed. E-mail: odimm@run.edu.ng

Adv. Sci. Technol. Eng. Syst. J. 5(4), 266-272 (2020); a  DOI: 10.25046/aj050433

Keywords: Airline image branding, Naïve Bayes, Sentiment analysis

Share
277 Downloads

Export Citations

Airline operators get much feedback from their customers which are vital for both operational and strategic planning. Social media has become one of the most popular platforms for obtaining such feedback. However, to analyze, categorize, and generate useful insight from the huge quantity of data on social media is not a trivial task. This study investigates the capability of the Naïve Bayes classifier for analyzing sentiments of airline image branding. It further examines the impact of data size on the accuracy of the classifier. We collected data about some online conversations relating to an incident where an airline’s security operatives roughly handled a passenger as a case study. It was reported that the incident resulted in a loss of about $1 billion of the company’s corporate value. Data were extracted from twitter, preprocessed and analyzed using the Naïve Bayes Classifier. The findings showed a 62.53% negative and 37.47% positive sentiments about the incident with a classification accuracy of over 0.97. To assess the impact of training size on the accuracy of the classifier, the training sets were varied into different sizes. A direct linear relationship between the training size and the classifier’s accuracy was observed. This implies that large training data sets have the potentials for increasing the classification accuracy of the classifier. However, it was also observed that a continuous increase in the classification size could lead to overfitting. Hence there is a need to develop mechanisms for determining optimum training size for finest accuracy of the classifier. The negative perceptions of customers could have a damaging effect on a brand and ultimately lead to a catastrophic loss in the organization.

Received: 20 April 2020, Accepted: 08 July 2020, Published Online: 28 July 2020

  1. P. Greenberg, CRM at the Speed of Light, Fourth Edition: Social CRM 2.0 Strategies, Tools, and Techniques for Engaging Your Customers, 4th ed., New York City: McGraw-Hill Education, 2009.
  2. B. Liu, Sentiment Analysis and Opinion Mining, Virginia: Morgan & Claypool Publishers, 2012.
  3. S. Gupta, “Sentiment Analysis: Concept, Analysis and Applications,” 2018.
  4. A. Tripathy, A. Agrawal and S. K. Rath, “Classification of Sentimental Reviews Using Machine Learning Techniques,” Procedia Computer Science, 57, 821 – 829, 2015. https://doi:org/10.1016/j.procs.2015.07.523
  5. M. Salam, “Security Officers Fired for United Airlines Dragging Episode,” The New York Times, 17 October 2017.
  6. I. Chaturvedi, E. Cambria, R. E. Welsch and F. Herrera, “Distinguishing between facts and opinions for sentiment analysis: Survey and Challenges,” Information Fusion, 44, 65 – 77, 2018. https://doi.org/10.1016/j.inffus.2017.12.006
  7. A. Tamilselvi and M. ParveenTaj, “Sentiment Analysis of Microblogs using Opinion Mining Classification Algorithm,” International Journal of Science and Research, 2 (10), 196 – 202, 2013.
  8. V. A. Kharde and S. S. Sonawane, “Sentiment Analysis of Twitter Data: A Survey of,” International Journal of Computer Applications, 139(11), 5-15, 2016. https://doi.org/10.5120/ijca2016908625.
  9. L. Zhanga, K. Huac, H. Wangd, G. Qiane and L. Zhanga, “Sentiment Analysis on Reviews of Mobile Users,” Procedía Computer Science, 34, 458 – 465, 2014. https://doi.org/10.1016/j.procs.2014.07.013
  10. L. Martin-Domingoa, J. C. Martínb and G. Mandsberg, “Social media as a resource for sentiment analysis of Airport Service Quality (ASQ),” Journal of Air Transport Management, 78, 106-115, 2019. https://doi.org/10.1016/j.jairtraman.2019.01.004
  11. G. Vinodhini and R. Chandrasekaran, “A comparative performance evaluation of neural network-based approach for sentiment classification of online reviews,” Journal of King Saud University – Computer and Information Sciences, 28, 2–12, 2016. https://doi.org/10.1016/j.jksuci.2014.03.024
  12. Y. AL Amrani, M. Lazaar and K. E. EL Kadiri, “Random Forest and Support Vector Machine based Hybrid Approach to Sentiment Analysis,” Procedia Computer Science, p. 511–520, 2018. https://doi.org/10.1016/j.procs.2018.01.150
  13. M. D. Devika, C. Sunitha and A. Ganesh, “Sentiment Analysis: A Comparative Study on Different Approaches,” Procedia Computer Science, 87, 2016. https://doi.org/10.1016/j.procs.2016.05.124
  14. P. Shahana and B. Omman, “Evaluation of Features on Sentimental Analysis,” Procedia Computer Science, 46, (2015), 1585 – 1592, 2015. https://doi:org/10.1016/j.procs.2015.02.088
  15. K. Mehmood, D. Essam, K. Shafi and M. K. Malik, “Sentiment Analysis for a Resource Poor Language—Roman Urdu,” ACM Transactions on Asian and Low-Resource Language Information Processing, 19(1), 10.1-10.15, 2019. https://doi:org/10.1145/3329709
  16. M. O. Odim and V. C. Osamor, “Required Bandwidth Capacity Estimation Scheme for Improved Internet Service Delivery: A Machine Learning Approach,” International Journal of Scientific & Technology Research, 8(8), 326 – 334, 2019.
  17. M. O. Odim, J. A. Gbadeyan and J. S. Sadiku, “Modelling the Multi-Layer Artificial Neural Network for Internet Traffic Forecasting: The Model Selection Design Issues,” in ACM Computing Research and Innovations (CoRI 2016), Ibadan, 2016.

Citations by Dimensions

Citations by PlumX

Google Scholar

Scopus