Linguistic traces of a scientific fraud: the case of Diederik Stapel.

David M Markowitz, Jeffrey T Hancock
Author Information
  1. David M Markowitz: Department of Communication, Cornell University, Ithaca, New York, United States of America.
  2. Jeffrey T Hancock: Department of Communication, Cornell University, Ithaca, New York, United States of America; Department of Information Science, Cornell University, Ithaca, New York, United States of America.

Abstract

When scientists report false data, does their writing style reflect their deception? In this study, we investigated the linguistic patterns of fraudulent (N  =  24; 170,008 words) and genuine publications (N  =  25; 189,705 words) first-authored by social psychologist Diederik Stapel. The analysis revealed that Stapel's fraudulent papers contained linguistic changes in science-related discourse dimensions, including more terms pertaining to methods, investigation, and certainty than his genuine papers. His writing style also matched patterns in other deceptive language, including fewer adjectives in fraudulent publications relative to genuine publications. Using differences in language dimensions we were able to classify Stapel's publications with above chance accuracy. Beyond these discourse dimensions, Stapel included fewer co-authors when reporting fake data than genuine data, although other evidentiary claims (e.g., number of references and experiments) did not differ across the two article types. This research supports recent findings that language cues vary systematically with deception, and that deception can be revealed in fraudulent scientific discourse.

References

  1. Psychol Sci. 2013 Oct;24(10):1875-88 [PMID: 23982243]
  2. PLoS One. 2012;7(10):e44118 [PMID: 23115617]
  3. Psychol Sci. 2011 Nov;22(11):1359-66 [PMID: 22006061]
  4. Proc Natl Acad Sci U S A. 2012 Oct 16;109(42):17028-33 [PMID: 23027971]
  5. Pers Soc Psychol Bull. 2003 May;29(5):665-75 [PMID: 15272998]
  6. PLoS One. 2009 May 29;4(5):e5738 [PMID: 19478950]

MeSH Term

Authorship
Biomedical Research
Deception
Fraud
Humans
Linguistics
Research Report
Science
Scientific Misconduct
Writing

Word Cloud

Created with Highcharts 10.0.0fraudulentgenuinepublicationsdataStapeldiscoursedimensionslanguagewritingstylelinguisticpatternsN = wordsDiederikrevealedStapel'spapersincludingfewerdeceptionscientificscientistsreportfalsereflectdeception?studyinvestigated2417000825189705first-authoredsocialpsychologistanalysiscontainedchangesscience-relatedtermspertainingmethodsinvestigationcertaintyalsomatcheddeceptiveadjectivesrelativeUsingdifferencesableclassifychanceaccuracyBeyondincludedco-authorsreportingfakealthoughevidentiaryclaimsegnumberreferencesexperimentsdifferacrosstwoarticletypesresearchsupportsrecentfindingscuesvarysystematicallycanLinguistictracesfraud:case

Similar Articles

Cited By