Evaluation of publication type tagging as a strategy to screen randomized controlled trial articles in preparing systematic reviews.

Jodi Schneider, Linh Hoang, Yogeshwar Kansara, Aaron M Cohen, Neil R Smalheiser
Author Information
  1. Jodi Schneider: School of Information Sciences, University of Illinois Urbana-Champaign, Champaign, Illinois, USA. ORCID
  2. Linh Hoang: School of Information Sciences, University of Illinois Urbana-Champaign, Champaign, Illinois, USA. ORCID
  3. Yogeshwar Kansara: School of Information Sciences, University of Illinois Urbana-Champaign, Champaign, Illinois, USA. ORCID
  4. Aaron M Cohen: Department of Medical Informatics and Clinical Epidemiology (DMICE), School of Medicine, Oregon Health & Science University, Portland, Oregon, USA. ORCID
  5. Neil R Smalheiser: Department of Psychiatry, College of Medicine, University of Illinois Chicago, Chicago, Illinois, USA. ORCID

Abstract

Objectives: To produce a systematic review (SR), reviewers typically screen thousands of titles and abstracts of articles manually to find a small number which are read in full text to find relevant articles included in the final SR. Here, we evaluate a proposed automated probabilistic publication type screening strategy applied to the randomized controlled trial (RCT) articles (i.e., those which present clinical outcome results of RCT studies) included in a corpus of previously published Cochrane reviews.
Materials and Methods: We selected a random subset of 558 published Cochrane reviews that specified RCT study only inclusion criteria, containing 7113 included articles which could be matched to PubMed identifiers. These were processed by our automated RCT Tagger tool to estimate the probability that each article reports clinical outcomes of a RCT.
Results: Removing articles with low predictive scores  < 0.01 eliminated 288 included articles, of which only 22 were actually typical RCT articles, and only 18 were actually typical RCT articles that MEDLINE indexed as such. Based on our sample set, this screening strategy led to fewer than 0.05 relevant RCT articles being missed on average per Cochrane SR.
Discussion: This scenario, based on real SRs, demonstrates that automated tagging can identify RCT articles accurately while maintaining very high recall. However, we also found that even SRs whose inclusion criteria are restricted to RCT studies include not only clinical outcome articles per se, but a variety of ancillary article types as well.
Conclusions: This encourages further studies learning how best to incorporate automated tagging of additional publication types into SR triage workflows.

Keywords

References

  1. Res Synth Methods. 2018 Dec;9(4):602-614 [PMID: 29314757]
  2. J Clin Epidemiol. 2020 May;121:81-90 [PMID: 32004673]
  3. Health Info Libr J. 2009 Sep;26(3):187-202 [PMID: 19712211]
  4. Syst Rev. 2014 Jul 09;3:74 [PMID: 25005128]
  5. Syst Rev. 2020 Apr 2;9(1):73 [PMID: 32241297]
  6. Syst Rev. 2016 May 23;5:87 [PMID: 27216467]
  7. BMJ. 2009 Jul 21;339:b2535 [PMID: 19622551]
  8. Syst Rev. 2019 Nov 15;8(1):278 [PMID: 31727150]
  9. J Family Med Prim Care. 2013 Jan;2(1):9-14 [PMID: 24479036]
  10. J Am Med Inform Assoc. 2015 May;22(3):707-17 [PMID: 25656516]
  11. Syst Rev. 2018 Apr 25;7(1):64 [PMID: 29695296]
  12. J Am Med Inform Assoc. 2017 Nov 01;24(6):1165-1168 [PMID: 28541493]
  13. BMC Med Res Methodol. 2020 Jun 3;20(1):139 [PMID: 32493228]
  14. Syst Rev. 2013 Sep 25;2:78 [PMID: 24066664]
  15. BMC Med Res Methodol. 2020 Oct 15;20(1):256 [PMID: 33059590]
  16. N Engl J Med. 2016 Jun 2;374(22):2175-81 [PMID: 27248626]
  17. Syst Rev. 2019 Nov 15;8(1):277 [PMID: 31727159]
  18. Syst Rev. 2020 May 4;9(1):100 [PMID: 32366302]

Grants

  1. R01 LM010817/NLM NIH HHS

Word Cloud

Created with Highcharts 10.0.0articlesRCTSRincludedautomatedsystematicpublicationstrategyrandomizedcontrolledclinicalstudiesCochranereviewstaggingreviewscreenfindrelevanttypescreeningtrialoutcomepublishedinclusioncriteriaTaggerarticleactuallytypicalperSRstypesObjectives:producereviewerstypicallythousandstitlesabstractsmanuallysmallnumberreadfulltextfinalevaluateproposedprobabilisticappliediepresentresultscorpuspreviouslyMaterialsMethods:selectedrandomsubset558specifiedstudycontaining7113matchedPubMedidentifiersprocessedtoolestimateprobabilityreportsoutcomesResults:Removinglowpredictivescores< 001eliminated2882218MEDLINEindexedBasedsamplesetledfewer005missedaverageDiscussion:scenariobasedrealdemonstratescanidentifyaccuratelymaintaininghighrecallHoweveralsofoundevenwhoserestrictedincludesevarietyancillarywellConclusions:encourageslearningbestincorporateadditionaltriageworkflowsEvaluationpreparinginformationretrievaltrialsautomation

Similar Articles

Cited By (1)