Comprehensive review of publicly available colonoscopic imaging databases for artificial intelligence research: availability, accessibility, and usability.

Britt B S L Houwen, Karlijn J Nass, Jasper L A Vleugels, Paul Fockens, Yark Hazewinkel, Evelien Dekker
Author Information
  1. Britt B S L Houwen: Department of Gastroenterology and Hepatology, Amsterdam Gastroenterology Endocrinology Metabolism, Amsterdam University Medical Centres, location Academic Medical Center, University of Amsterdam, Amsterdam, the Netherlands.
  2. Karlijn J Nass: Department of Gastroenterology and Hepatology, Amsterdam Gastroenterology Endocrinology Metabolism, Amsterdam University Medical Centres, location Academic Medical Center, University of Amsterdam, Amsterdam, the Netherlands.
  3. Jasper L A Vleugels: Department of Gastroenterology and Hepatology, Amsterdam Gastroenterology Endocrinology Metabolism, Amsterdam University Medical Centres, location Academic Medical Center, University of Amsterdam, Amsterdam, the Netherlands.
  4. Paul Fockens: Department of Gastroenterology and Hepatology, Amsterdam Gastroenterology Endocrinology Metabolism, Amsterdam University Medical Centres, location Academic Medical Center, University of Amsterdam, Amsterdam, the Netherlands.
  5. Yark Hazewinkel: Department of Gastroenterology and Hepatology, Radboud University Nijmegen Medical Center, Radboud University of Nijmegen, Nijmegen, the Netherlands.
  6. Evelien Dekker: Department of Gastroenterology and Hepatology, Amsterdam Gastroenterology Endocrinology Metabolism, Amsterdam University Medical Centres, location Academic Medical Center, University of Amsterdam, Amsterdam, the Netherlands.

Abstract

BACKGROUND AND AIMS: Publicly available databases containing colonoscopic imaging data are valuable resources for artificial intelligence (AI) research. Currently, little is known regarding the available number and content of these databases. This review aimed to describe the availability, accessibility, and usability of publicly available colonoscopic imaging databases, focusing on polyp detection, polyp characterization, and quality of colonoscopy.
METHODS: A systematic literature search was performed in MEDLINE and Embase to identify AI studies describing publicly available colonoscopic imaging databases published after 2010. Second, a targeted search using Google's Dataset Search, Google Search, GitHub, and Figshare was done to identify databases directly. Databases were included if they contained data about polyp detection, polyp characterization, or quality of colonoscopy. To assess accessibility of databases, the following categories were defined: open access, open access with barriers, and regulated access. To assess the potential usability of the included databases, essential details of each database were extracted using a checklist derived from the Checklist for Artificial Intelligence in Medical Imaging.
RESULTS: We identified 22 databases with open access, 3 databases with open access with barriers, and 15 databases with regulated access. The 22 open access databases contained 19,463 images and 952 videos. Nineteen of these databases focused on polyp detection, localization, and/or segmentation; 6 on polyp characterization, and 3 on quality of colonoscopy. Only half of these databases have been used by other researcher to develop, train, or benchmark their AI system. Although technical details were in general well reported, important details such as polyp and patient demographics and the annotation process were under-reported in almost all databases.
CONCLUSIONS: This review provides greater insight on public availability of colonoscopic imaging databases for AI research. Incomplete reporting of important details limits the ability of researchers to assess the usability of current databases.

MeSH Term

Humans
Artificial Intelligence
Colonic Polyps
Colonoscopes
Colonoscopy
Radiography

Word Cloud

Created with Highcharts 10.0.0databasespolypaccessavailablecolonoscopicimagingopenAIusabilitydetailsreviewavailabilityaccessibilitypubliclydetectioncharacterizationqualitycolonoscopyassessdataartificialintelligenceresearchsearchidentifyusingSearchincludedcontainedbarriersregulated223importantBACKGROUNDANDAIMS:PubliclycontainingvaluableresourcesCurrentlylittleknownregardingnumbercontentaimeddescribefocusingMETHODS:systematicliteratureperformedMEDLINEEmbasestudiesdescribingpublished2010SecondtargetedGoogle'sDatasetGoogleGitHubFigsharedonedirectlyDatabasesfollowingcategoriesdefined:potentialessentialdatabaseextractedchecklistderivedChecklistArtificialIntelligenceMedicalImagingRESULTS:identified1519463images952videosNineteenfocusedlocalizationand/orsegmentation6halfusedresearcherdeveloptrainbenchmarksystemAlthoughtechnicalgeneralwellreportedpatientdemographicsannotationprocessunder-reportedalmostCONCLUSIONS:providesgreaterinsightpublicIncompletereportinglimitsabilityresearcherscurrentComprehensiveresearch:

Similar Articles

Cited By