Deep Learning Model for COVID-19 Sentiment Analysis on Twitter.

Salvador Contreras Hernández, María Patricia Tzili Cruz, José Martín Espínola Sánchez, Angélica Pérez Tzili
Author Information
  1. Salvador Contreras Hernández: Department of Informatics, Universidad Politécnica del Valle de México, 54910 Tultitlán Estado de México, Mexico. ORCID
  2. María Patricia Tzili Cruz: Department of Informatics, Universidad Politécnica del Valle de México, 54910 Tultitlán Estado de México, Mexico. ORCID
  3. José Martín Espínola Sánchez: Department of Informatics, Universidad Politécnica del Valle de México, 54910 Tultitlán Estado de México, Mexico. ORCID
  4. Angélica Pérez Tzili: School of Medicine, Instituto Politécnico Nacional, 07738 Mexico City, Mexico. ORCID

Abstract

The COVID-19 pandemic impacted the mood of the people, and this was evident on social networks. These common user publications are a source of information to measure the population's opinion on social phenomena. In particular, the Twitter network represents a resource of great value due to the amount of information, the geographical distribution of the publications and the openness to dispose of them. This work presents a study on the feelings of the population in Mexico during one of the waves that produced the most contagion and deaths in this country. A mixed, semi-supervised approach was used, with a lexical-based data labeling technique to later bring these data to a pre-trained model of Transformers completely in Spanish. Two Spanish-language models were trained by adding to the Transformers neural network the adjustment for the sentiment analysis task specifically on COVID-19. In addition, ten other multilanguage Transformer models including the Spanish language were trained with the same data set and parameters to compare their performance. In addition, other classifiers with the same data set were used for training and testing, such as Support Vector Machines, Naive Bayes, Logistic Regression, and Decision Trees. These performances were compared with the exclusive model in Spanish based on Transformers, which had higher precision. Finally, this model was used, developed exclusively based on the Spanish language, with new data, to measure the sentiment about COVID-19 of the Twitter community in Mexico.

Keywords

References

  1. Cognit Comput. 2021;13(5):1154-1171 [PMID: 34306241]
  2. Soc Netw Anal Min. 2021;11(1):33 [PMID: 33758630]
  3. Neural Comput Appl. 2022;34(23):20449-20461 [PMID: 34316097]
  4. SN Comput Sci. 2021;2(3):201 [PMID: 33851137]
  5. Int J Inf Technol. 2021;13(4):1555-1567 [PMID: 33778365]
  6. Soc Netw Anal Min. 2021;11(1):66 [PMID: 34335992]
  7. Multimed Tools Appl. 2022;81(5):6131-6157 [PMID: 35018132]
  8. IEEE J Biomed Health Inform. 2020 Oct;24(10):2733-2742 [PMID: 32750931]
  9. Health Inf Sci Syst. 2021 Jun 25;9(1):25 [PMID: 34188896]
  10. J Big Data. 2021;8(1):95 [PMID: 34249602]
  11. BioData Min. 2021 Mar 29;14(1):21 [PMID: 33781293]
  12. Multimed Tools Appl. 2022;81(4):5587-5620 [PMID: 34975284]
  13. Cognit Comput. 2016;8:757-771 [PMID: 27563360]
  14. Inf Syst Front. 2021;23(6):1417-1429 [PMID: 33897274]
  15. Arab J Sci Eng. 2022;47(2):2499-2511 [PMID: 34660170]
  16. Artif Intell Med. 2019 Jan;93:50-57 [PMID: 29685725]
  17. J Comput Soc Sci. 2020;3(2):319-342 [PMID: 33163686]
  18. Nucleic Acids Res. 2018 Jul 2;46(W1):W523-W529 [PMID: 29788413]
  19. Soc Netw Anal Min. 2021;11(1):81 [PMID: 34484462]
  20. Multimed Tools Appl. 2021;80(8):11765-11788 [PMID: 33432264]
  21. Cognit Comput. 2022;14(1):274-299 [PMID: 34422122]

Word Cloud

Created with Highcharts 10.0.0COVID-19dataTwitterSpanishusedmodelTransformerssocialpublicationsinformationmeasurenetworkMexicomodelstrainedsentimentanalysisadditionlanguagesetbasedSentimentpandemicimpactedmoodpeopleevidentnetworkscommonusersourcepopulation'sopinionphenomenaparticularrepresentsresourcegreatvaluedueamountgeographicaldistributionopennessdisposeworkpresentsstudyfeelingspopulationonewavesproducedcontagiondeathscountrymixedsemi-supervisedapproachlexical-basedlabelingtechniquelaterbringpre-trainedcompletelyTwoSpanish-languageaddingneuraladjustmenttaskspecificallytenmultilanguageTransformerincludingparameterscompareperformanceclassifierstrainingtestingSupportVectorMachinesNaiveBayesLogisticRegressionDecisionTreesperformancescomparedexclusivehigherprecisionFinallydevelopedexclusivelynewcommunityDeepLearningModelAnalysisBERT

Similar Articles

Cited By