Fake news detection based on a hybrid BERT and LightGBM models.

Ehab Essa, Karima Omar, Ali Alqahtani
Author Information
  1. Ehab Essa: Department of Computer Science, Faculty of Computer and Information Sciences, Mansoura University, Mansoura, 35516 Egypt. ORCID
  2. Karima Omar: Department of Computer Science, Faculty of Computer and Information Sciences, Mansoura University, Mansoura, 35516 Egypt.
  3. Ali Alqahtani: Department of Computer Science, King Khalid University, 61421 Abha, Saudi Arabia.

Abstract

With the rapid growth of social networks and technology, knowing what news to believe and what not to believe become a challenge in this digital era. Fake news is defined as provably erroneous information transmitted intending to defraud. This kind of misinformation poses a serious threat to social cohesion and well-being, since it fosters political polarisation and can destabilize trust in the government or the service provided. As a result, fake news detection has emerged as an important field of study, with the goal of identifying whether a certain piece of content is real or fake. In this paper, we propose a novel hybrid fake news detection system that combines a BERT-based (bidirectional encoder representations from transformers) with a light gradient boosting machine (LightGBM) model. We compare the performance of the proposed method to four different classification approaches using different word embedding techniques on three real-world fake news datasets to validate the performance of the proposed method compared to other methods. The proposed method is evaluated to detect fake news based on the headline-only or full text of the news content. The results show the superiority of the proposed method for fake news detection compared to many state-of-the-art methods.

Keywords

Word Cloud

Created with Highcharts 10.0.0newsfakedetectionproposedmethodLightGBMsocialbelieveFakecontenthybridperformancedifferentcomparedmethodsbasedBERTrapidgrowthnetworkstechnologyknowingbecomechallengedigitaleradefinedprovablyerroneousinformationtransmittedintendingdefraudkindmisinformationposesseriousthreatcohesionwell-beingsincefosterspoliticalpolarisationcandestabilizetrustgovernmentserviceprovidedresultemergedimportantfieldstudygoalidentifyingwhethercertainpiecerealpaperproposenovelsystemcombinesBERT-basedbidirectionalencoderrepresentationstransformerslightgradientboostingmachinemodelcomparefourclassificationapproachesusingwordembeddingtechniquesthreereal-worlddatasetsvalidateevaluateddetectheadline-onlyfulltextresultsshowsuperioritymanystate-of-the-artmodelsFakeNewsTransformers

Similar Articles

Cited By (2)