Deep learning for digital pathology image analysis: A comprehensive tutorial with selected use cases.

Andrew Janowczyk, Anant Madabhushi
Author Information
  1. Andrew Janowczyk: Department of Biomedical Engineering, Case Western Reserve University, Cleveland, OH 44106, USA.
  2. Anant Madabhushi: Department of Biomedical Engineering, Case Western Reserve University, Cleveland, OH 44106, USA.

Abstract

BACKGROUND: Deep learning (DL) is a representation learning approach ideally suited for image analysis challenges in digital pathology (DP). The variety of image analysis tasks in the context of DP includes detection and counting (e.g., mitotic events), segmentation (e.g., nuclei), and tissue classification (e.g., cancerous vs. non-cancerous). Unfortunately, issues with slide preparation, variations in staining and scanning across sites, and vendor platforms, as well as biological variance, such as the presentation of different grades of disease, make these image analysis tasks particularly challenging. Traditional approaches, wherein domain-specific cues are manually identified and developed into task-specific "handcrafted" features, can require extensive tuning to accommodate these variances. However, DL takes a more domain agnostic approach combining both feature discovery and implementation to maximally discriminate between the classes of interest. While DL approaches have performed well in a few DP related image analysis tasks, such as detection and tissue classification, the currently available open source tools and tutorials do not provide guidance on challenges such as (a) selecting appropriate magnification, (b) managing errors in annotations in the training (or learning) dataset, and (c) identifying a suitable training set containing information rich exemplars. These foundational concepts, which are needed to successfully translate the DL paradigm to DP tasks, are non-trivial for (i) DL experts with minimal digital histology experience, and (ii) DP and image processing experts with minimal DL experience, to derive on their own, thus meriting a dedicated tutorial.
AIMS: This paper investigates these concepts through seven unique DP tasks as use cases to elucidate techniques needed to produce comparable, and in many cases, superior to results from the state-of-the-art hand-crafted feature-based classification approaches.
RESULTS: Specifically, in this tutorial on DL for DP image analysis, we show how an open source framework (Caffe), with a singular network architecture, can be used to address: (a) nuclei segmentation (F-score of 0.83 across 12,000 nuclei), (b) epithelium segmentation (F-score of 0.84 across 1735 regions), (c) tubule segmentation (F-score of 0.83 from 795 tubules), (d) lymphocyte detection (F-score of 0.90 across 3064 lymphocytes), (e) mitosis detection (F-score of 0.53 across 550 mitotic events), (f) invasive ductal carcinoma detection (F-score of 0.7648 on 50 k testing patches), and (g) lymphoma classification (classification accuracy of 0.97 across 374 images).
CONCLUSION: This paper represents the largest comprehensive study of DL approaches in DP to date, with over 1200 DP images used during evaluation. The supplemental online material that accompanies this paper consists of step-by-step instructions for the usage of the supplied source code, trained models, and input data.

Keywords

References

  1. IEEE Trans Biomed Eng. 2014 May;61(5):1400-11 [PMID: 24759275]
  2. IEEE Rev Biomed Eng. 2014;7:97-114 [PMID: 24802905]
  3. Med Image Comput Comput Assist Interv. 2013;16(Pt 2):411-8 [PMID: 24579167]
  4. Comput Methods Biomech Biomed Eng Imaging Vis. 2018;6(3):270-276 [PMID: 29732269]
  5. J Med Imaging (Bellingham). 2014 Oct;1(3):034003 [PMID: 26158062]
  6. IEEE Trans Biomed Eng. 2012 May;59(5):1205-18 [PMID: 20570758]
  7. IEEE Trans Med Imaging. 2016 May;35(5):1196-1206 [PMID: 26863654]
  8. J Pathol Inform. 2016 Jul 26;7:29 [PMID: 27563488]
  9. Sci Transl Med. 2011 Nov 9;3(108):108ra113 [PMID: 22072638]
  10. Anal Quant Cytol Histol. 2001 Aug;23(4):291-9 [PMID: 11531144]
  11. Mod Pathol. 2004 Mar;17(3):292-306 [PMID: 14976540]
  12. Med Image Anal. 2015 Feb;20(1):237-48 [PMID: 25547073]
  13. Neurocomputing (Amst). 2016 May 26;191:214-223 [PMID: 28154470]
  14. Anticancer Res. 1998 Jan-Feb;18(1B):571-6 [PMID: 9568179]
  15. IEEE Rev Biomed Eng. 2009;2:147-71 [PMID: 20671804]
  16. Proc IEEE Int Symp Biomed Imaging. 2015 Apr;2015:1284-1287 [PMID: 28101301]
  17. Nature. 2015 May 28;521(7553):436-44 [PMID: 26017442]
  18. J Pathol Inform. 2013 May 30;4:8 [PMID: 23858383]
  19. IEEE Trans Med Imaging. 2016 Jan;35(1):119-30 [PMID: 26208307]
  20. IEEE Trans Med Imaging. 2015 Nov;34(11):2366-78 [PMID: 25993703]
  21. AMIA Annu Symp Proc. 2015 Nov 05;2015:1899-908 [PMID: 26958289]
  22. PeerJ. 2017 Oct 3;5:e3874 [PMID: 29018612]
  23. Med Image Comput Comput Assist Interv. 2012;15(Pt 1):348-56 [PMID: 23285570]
  24. Annu Rev Biomed Eng. 2016 Jul 11;18:387-412 [PMID: 27420575]
  25. IEEE Trans Biomed Eng. 2010 Jul;57(7):1676-89 [PMID: 20172780]
  26. J Pathol Inform. 2011;2:S1 [PMID: 22811953]
  27. Am J Surg Pathol. 2014 Jan;38(1):128-37 [PMID: 24145650]
  28. Pattern Recognit Lett. 2008 Jan;29(11):1684-1693 [PMID: 18958301]

Grants

  1. R01 CA202752/NCI NIH HHS
  2. R21 CA179327/NCI NIH HHS
  3. R21 CA195152/NCI NIH HHS

Word Cloud

Created with Highcharts 10.0.0DPDLimage0learningdetectionacrossF-scoreanalysistaskssegmentationclassificationdigitalegapproachesnucleisourcetutorialpapercasesDeepapproachchallengespathologymitoticeventstissuewellcanopenbtrainingcconceptsneededexpertsminimalhistologyexperienceuseused83imagescomprehensiveBACKGROUND:representationideallysuitedvarietycontextincludescountingcancerousvsnon-cancerousUnfortunatelyissuesslidepreparationvariationsstainingscanningsitesvendorplatformsbiologicalvariancepresentationdifferentgradesdiseasemakeparticularlychallengingTraditionalwhereindomain-specificcuesmanuallyidentifieddevelopedtask-specific"handcrafted"featuresrequireextensivetuningaccommodatevariancesHowevertakesdomainagnosticcombiningfeaturediscoveryimplementationmaximallydiscriminateclassesinterestperformedrelatedcurrentlyavailabletoolstutorialsprovideguidanceselectingappropriatemagnificationmanagingerrorsannotationsdatasetidentifyingsuitablesetcontaininginformationrichexemplarsfoundationalsuccessfullytranslateparadigmnon-trivialiiprocessingderivethusmeritingdedicatedAIMS:investigatessevenuniqueelucidatetechniquesproducecomparablemanysuperiorresultsstate-of-the-arthand-craftedfeature-basedRESULTS:SpecificallyshowframeworkCaffesingularnetworkarchitectureaddress:12000epithelium841735regionstubule795tubulesdlymphocyte903064lymphocytesmitosis53550finvasiveductalcarcinoma764850ktestingpatcheslymphomaaccuracy97374CONCLUSION:representslargeststudydate1200evaluationsupplementalonlinematerialaccompaniesconsistsstep-by-stepinstructionsusagesuppliedcodetrainedmodelsinputdataanalysis:selectedClassificationdeepmachine

Similar Articles

Cited By