COVID19-CT-dataset: an open-access chest CT image repository of 1000+ patients with confirmed COVID-19 diagnosis.

Shokouh Shakouri, Mohammad Amin Bakhshali, Parvaneh Layegh, Behzad Kiani, Farid Masoumi, Saeedeh Ataei Nakhaei, Sayyed Mostafa Mostafavi
Author Information
  1. Shokouh Shakouri: Department of Medical Informatics, School of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran.
  2. Mohammad Amin Bakhshali: Department of Medical Informatics, School of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran.
  3. Parvaneh Layegh: Department of Radiology, Faculty of Medicine, Imam Reza Hospital, Mashhad University of Medical Sciences, Mashhad, Iran.
  4. Behzad Kiani: Department of Medical Informatics, School of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran. ORCID
  5. Farid Masoumi: Department of Medical Informatics, School of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran.
  6. Saeedeh Ataei Nakhaei: Nuclear Medicine Research Center, School of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran.
  7. Sayyed Mostafa Mostafavi: Department of Medical Informatics, School of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran. MostafaviTM@mums.ac.ir. ORCID

Abstract

OBJECTIVES: The ongoing Coronavirus disease 2019 (COVID-19) pandemic has drastically impacted the global health and economy. Computed tomography (CT) is the prime imaging modality for diagnosis of lung infections in COVID-19 patients. Data-driven and Artificial intelligence (AI)-powered solutions for automatic processing of CT images predominantly rely on large-scale, heterogeneous datasets. Owing to privacy and data availability issues, open-access and publicly available COVID-19 CT datasets are difficult to obtain, thus limiting the development of AI-enabled automatic diagnostic solutions. To tackle this problem, large CT image datasets encompassing diverse patterns of lung infections are in high demand.
DATA DESCRIPTION: In the present study, we provide an open-source repository containing 1000+ CT images of COVID-19 lung infections established by a team of board-certified radiologists. CT images were acquired from two main general university hospitals in Mashhad, Iran from March 2020 until January 2021. COVID-19 infections were ratified with matching tests including Reverse transcription polymerase chain reaction (RT-PCR) and accompanying clinical symptoms. All data are 16-bit grayscale images composed of 512 × 512 pixels and are stored in DICOM standard. Patient privacy is preserved by removing all patient-specific information from image headers. Subsequently, all images corresponding to each patient are compressed and stored in RAR format.

Keywords

References

  1. BMC Res Notes. 2021 May 12;14(1):178 [PMID: 33980279]
  2. Geospat Health. 2020 Nov 26;15(2): [PMID: 33461262]
  3. IEEE Rev Biomed Eng. 2021;14:4-15 [PMID: 32305937]
  4. Radiology. 2008 Mar;246(3):697-722 [PMID: 18195376]
  5. IEEE Access. 2021 Feb 10;9:30551-30572 [PMID: 34976571]
  6. Sci Data. 2021 Apr 29;8(1):121 [PMID: 33927208]

Grants

  1. 991315/Mashhad University of Medical Sciences

MeSH Term

Artificial Intelligence
COVID-19
COVID-19 Testing
Humans
Iran
Lung
SARS-CoV-2
Tomography, X-Ray Computed

Word Cloud

Created with Highcharts 10.0.0COVID-19CTimagesinfectionsimagelungdatasetsCoronavirusComputedtomographyimagingdiagnosisArtificialintelligencesolutionsautomaticprivacydataopen-accessrepositorystoredOBJECTIVES:ongoingdisease2019pandemicdrasticallyimpactedglobalhealtheconomyprimemodalitypatientsData-drivenAI-poweredprocessingpredominantlyrelylarge-scaleheterogeneousOwingavailabilityissuespubliclyavailabledifficultobtainthuslimitingdevelopmentAI-enableddiagnostictackleproblemlargeencompassingdiversepatternshighdemandDATADESCRIPTION:presentstudyprovideopen-sourcecontaining1000+ CTestablishedteamboard-certifiedradiologistsacquiredtwomaingeneraluniversityhospitalsMashhadIranMarch2020January2021ratifiedmatchingtestsincludingReversetranscriptionpolymerasechainreactionRT-PCRaccompanyingclinicalsymptoms16-bitgrayscalecomposed512 × 512pixelsDICOMstandardPatientpreservedremovingpatient-specificinformationheadersSubsequentlycorrespondingpatientcompressedRARformatCOVID19-CT-dataset:chest1000+ patientsconfirmedChestClinicalDeeplearningDiagnosisLunginfectionRadiology

Similar Articles

Cited By