The semi-automation of title and abstract screening: a retrospective exploration of ways to leverage Abstrackr's relevance predictions in systematic and rapid reviews.

Allison Gates, Michelle Gates, Meghan Sebastianski, Samantha Guitard, Sarah A Elliott, Lisa Hartling
Author Information
  1. Allison Gates: Alberta Research Centre for Health Evidence, Department of Pediatrics, University of Alberta, Edmonton, Alberta, Canada. agates1@ualberta.ca. ORCID
  2. Michelle Gates: Alberta Research Centre for Health Evidence, Department of Pediatrics, University of Alberta, Edmonton, Alberta, Canada.
  3. Meghan Sebastianski: Alberta Strategy for Patient-Oriented Research (SPOR) SUPPORT Unit Knowledge Translation Platform, University of Alberta, Edmonton, Alberta, Canada.
  4. Samantha Guitard: Alberta Research Centre for Health Evidence, Department of Pediatrics, University of Alberta, Edmonton, Alberta, Canada.
  5. Sarah A Elliott: Alberta Research Centre for Health Evidence, Department of Pediatrics, University of Alberta, Edmonton, Alberta, Canada.
  6. Lisa Hartling: Alberta Research Centre for Health Evidence, Department of Pediatrics, University of Alberta, Edmonton, Alberta, Canada.

Abstract

BACKGROUND: We investigated the feasibility of using a machine learning tool's relevance predictions to expedite title and abstract screening.
METHODS: We subjected 11 systematic reviews and six rapid reviews to four retrospective screening simulations (automated and semi-automated approaches to single-reviewer and dual independent screening) in Abstrackr, a freely-available machine learning software. We calculated the proportion missed, workload savings, and time savings compared to single-reviewer and dual independent screening by human reviewers. We performed cited reference searches to determine if missed studies would be identified via reference list scanning.
RESULTS: For systematic reviews, the semi-automated, dual independent screening approach provided the best balance of time savings (median (range) 20 (3-82) hours) and reliability (median (range) proportion missed records, 1 (0-14)%). The cited references search identified 59% (n = 10/17) of the records missed. For the rapid reviews, the fully and semi-automated approaches saved time (median (range) 9 (2-18) hours and 3 (1-10) hours, respectively), but less so than for the systematic reviews. The median (range) proportion missed records for both approaches was 6 (0-22)%.
CONCLUSION: Using Abstrackr to assist one of two reviewers in systematic reviews saves time with little risk of missing relevant records. Many missed records would be identified via other means.

Keywords

References

  1. Syst Rev. 2022 Dec 1;11(1):258 [PMID: 36457048]
  2. Syst Rev. 2015 Jan 14;4:5 [PMID: 25588314]
  3. Syst Rev. 2019 Jun 18;8(1):143 [PMID: 31215463]
  4. Syst Rev. 2017 Feb 17;6(1):32 [PMID: 28212677]
  5. BMJ. 2017 Sep 21;358:j4008 [PMID: 28935701]
  6. Syst Rev. 2015 Jun 15;4:80 [PMID: 26073974]
  7. Syst Rev. 2018 Mar 12;7(1):45 [PMID: 29530097]
  8. Syst Rev. 2019 Nov 15;8(1):278 [PMID: 31727150]
  9. Res Synth Methods. 2011 Mar;2(1):1-14 [PMID: 26061596]
  10. BMC Med Res Methodol. 2019 Jun 28;19(1):132 [PMID: 31253092]
  11. BMC Med. 2015 Sep 16;13:224 [PMID: 26377409]
  12. BMC Bioinformatics. 2010 Jan 26;11:55 [PMID: 20102628]
  13. BMJ Open. 2017 Feb 27;7(2):e012545 [PMID: 28242767]
  14. Inj Prev. 2020 Feb;26(1):55-60 [PMID: 31451565]

Grants

  1. n/a/Alberta Innovates

MeSH Term

Automation
Humans
Machine Learning
Reproducibility of Results
Retrospective Studies
Systematic Reviews as Topic

Word Cloud

Created with Highcharts 10.0.0reviewsmissedscreeningsystematicrecordstimemedianrangelearningrapidsemi-automatedapproachesdualindependentproportionsavingsidentifiedhoursmachinerelevancepredictionstitleabstractretrospectivesingle-reviewerAbstrackrreviewerscitedreferencevia%BACKGROUND:investigatedfeasibilityusingtool'sexpediteMETHODS:subjected11sixfoursimulationsautomatedfreely-availablesoftwarecalculatedworkloadcomparedhumanperformedsearchesdeterminestudieslistscanningRESULTS:approachprovidedbestbalance203-82reliability10-14referencessearch59%n = 10/17fullysaved92-1831-10respectivelyless60-22CONCLUSION:UsingassistonetwosaveslittleriskmissingrelevantManymeanssemi-automationscreening:explorationwaysleverageAbstrackr'sAutomationEfficiencyMachineRapidSystematic

Similar Articles

Cited By