Ensemble Techniques for Robust Fake News Detection: Integrating Transformers, Natural Language Processing, and Machine Learning.

Mohammed Al-Alshaqi, Danda B Rawat, Chunmei Liu
Author Information
  1. Mohammed Al-Alshaqi: Department of Electrical Engineering and Computer Science, Howard University, Washington, DC 20059, USA. ORCID
  2. Danda B Rawat: Department of Electrical Engineering and Computer Science, Howard University, Washington, DC 20059, USA. ORCID
  3. Chunmei Liu: Department of Electrical Engineering and Computer Science, Howard University, Washington, DC 20059, USA. ORCID

Abstract

The proliferation of fake news across multiple modalities has emerged as a critical challenge in the modern information landscape, necessitating advanced detection methods. This study proposes a comprehensive framework for fake news detection integrating text, images, and videos using machine learning and deep learning techniques. The research employs a dual-phased methodology, first analyzing textual data using various classifiers, then developing a multimodal approach combining BERT for text analysis and a modified CNN for visual data. Experiments on the ISOT fake news dataset and MediaEval 2016 image verification corpus demonstrate the effectiveness of the proposed models. For textual data, the Random Forest classifier achieved 99% accuracy, outperforming other algorithms. The multimodal approach showed superior performance compared to baseline models, with a 3.1% accuracy improvement over existing multimodal techniques. This research contributes to the ongoing efforts to combat misinformation by providing a robust, adaptable framework for detecting fake news across different media formats, addressing the complexities of modern information dissemination and manipulation.

Keywords

References

  1. Multimed Tools Appl. 2021;80(8):11765-11788 [PMID: 33432264]
  2. Inf Process Manag. 2021 Sep;58(5):102610 [PMID: 36567974]

Grants

  1. CCF-0939370/The U.S. National Science Foundation

Word Cloud

Created with Highcharts 10.0.0fakenewsdatadetectionlearningmultimodalacrossmoderninformationframeworktextusingmachinetechniquesresearchtextualapproachBERTCNNmodelsaccuracyproliferationmultiplemodalitiesemergedcriticalchallengelandscapenecessitatingadvancedmethodsstudyproposescomprehensiveintegratingimagesvideosdeepemploysdual-phasedmethodologyfirstanalyzingvariousclassifiersdevelopingcombininganalysismodifiedvisualExperimentsISOTdatasetMediaEval2016imageverificationcorpusdemonstrateeffectivenessproposedRandomForestclassifierachieved99%outperformingalgorithmsshowedsuperiorperformancecomparedbaseline31%improvementexistingcontributesongoingeffortscombatmisinformationprovidingrobustadaptabledetectingdifferentmediaformatsaddressingcomplexitiesdisseminationmanipulationEnsembleTechniquesRobustFakeNewsDetection:IntegratingTransformersNaturalLanguageProcessingMachineLearningNLPmulti-modaltransformers

Similar Articles

Cited By (1)