Construction of an Assisted Model Based on Natural Language Processing for Automatic Early Diagnosis of Autoimmune Encephalitis.

Yunsong Zhao, Bin Ren, Wenjin Yu, Haijun Zhang, Di Zhao, Junchao Lv, Zhen Xie, Kun Jiang, Lei Shang, Han Yao, Yongyong Xu, Gang Zhao
Author Information
  1. Yunsong Zhao: Department of Neurology, Xijing Hospital, Fourth Military Medical University, Xi'an, China.
  2. Bin Ren: Department of Information, Xijing Hospital, Fourth Military Medical University, Xi'an, China.
  3. Wenjin Yu: Department of Neurology, Xijing Hospital, Fourth Military Medical University, Xi'an, China.
  4. Haijun Zhang: Department of Neurology, Xijing Hospital, Fourth Military Medical University, Xi'an, China.
  5. Di Zhao: Department of Neurology, Xijing Hospital, Fourth Military Medical University, Xi'an, China.
  6. Junchao Lv: Department of Neurology, Xijing Hospital, Fourth Military Medical University, Xi'an, China.
  7. Zhen Xie: College of Life Sciences and Medicine, Northwest University, Xi'an, China.
  8. Kun Jiang: Department of Information, Xijing Hospital, Fourth Military Medical University, Xi'an, China.
  9. Lei Shang: Department of Health Statistics, Fourth Military Medical University, Xi'an, China.
  10. Han Yao: Department of Neurobiology, School of Basic Medicine, Fourth Military Medical University, Xi'an, China.
  11. Yongyong Xu: College of Life Sciences and Medicine, Northwest University, Xi'an, China. 20182028@nwu.edu.cn.
  12. Gang Zhao: Department of Neurology, Xijing Hospital, Fourth Military Medical University, Xi'an, China. zhaogang@nwu.edu.cn.

Abstract

INTRODUCTION: Early diagnosis and etiological treatment can effectively improve the prognosis of patients with autoimmune encephalitis (AE). However, anti-neuronal antibody tests which provide the definitive diagnosis require time and are not always abnormal. By using natural language processing (NLP) technology, our study proposes an assisted diagnostic method for early clinical diagnosis of AE and compares its sensitivity with that of previously established criteria.
METHODS: Our model is based on the text classification model trained by the history of present illness (HPI) in electronic medical records (EMRs) that present a definite pathological diagnosis of AE or infectious encephalitis (IE). The definitive diagnosis of IE was based on the results of traditional etiological examinations. The definitive diagnosis of AE was based on the results of neuronal antibodies, and the diagnostic criteria of definite autoimmune limbic encephalitis proposed by Graus et al. used as the reference standard for antibody-negative AE. First, we automatically recognized and extracted symptoms for all HPI texts in EMRs by training a dataset of 552 cases. Second, four text classification models trained by a dataset of 199 cases were established for differential diagnosis of AE and IE based on a post-structuring text dataset of every HPI, which was completed using symptoms in English language after the process of normalization of synonyms. The optimal model was identified by evaluating and comparing the performance of the four models. Finally, combined with three typical symptoms and the results of standard paraclinical tests such as cerebrospinal fluid (CSF), magnetic resonance imaging (MRI), or electroencephalogram (EEG) proposed from Graus criteria, an assisted early diagnostic model for AE was established on the basis of the text classification model with the best performance.
RESULTS: The comparison results for the four models applied to the independent testing dataset showed the naïve Bayesian classifier with bag of words achieved the best performance, with an area under the receiver operating characteristic curve of 0.85, accuracy of 84.5% (95% confidence interval [CI] 74.0-92.0%), sensitivity of 86.7% (95% CI 69.3-96.2%), and specificity of 82.9% (95% CI 67.9-92.8%), respectively. Compared with the diagnostic criteria proposed previously, the early diagnostic sensitivity for possible AE using the assisted diagnostic model based on the independent testing dataset was improved from 73.3% (95% CI 54.1-87.7%) to 86.7% (95% CI 69.3-96.2%).
CONCLUSIONS: The assisted diagnostic model could effectively increase the early diagnostic sensitivity for AE compared to previous diagnostic criteria, assist physicians in establishing the diagnosis of AE automatically after inputting the HPI and the results of standard paraclinical tests according to their narrative habits for describing symptoms, avoiding misdiagnosis and allowing for prompt initiation of specific treatment.

Keywords

References

  1. Lancet Neurol. 2013 Feb;12(2):157-65 [PMID: 23290630]
  2. Neuropsychol Rehabil. 2007 Aug-Oct;17(4-5):478-505 [PMID: 17676531]
  3. J Epilepsy Res. 2016 Dec 31;6(2):45-50 [PMID: 28101474]
  4. J Am Med Inform Assoc. 2014 Sep-Oct;21(5):808-14 [PMID: 24347408]
  5. Neurol Sci. 2017 Oct;38(Suppl 2):225-229 [PMID: 29030767]
  6. Eur J Neurol. 2016 Mar;23(3):621-9 [PMID: 26563553]
  7. Ann Neurol. 2014 Feb;75(2):317-23 [PMID: 24318406]
  8. J Neurol. 2019 Jan;266(1):124-132 [PMID: 30421340]
  9. J Biomed Inform. 2016 Apr;60:334-41 [PMID: 26923634]
  10. Int J Med Inform. 2020 Feb;134:103974 [PMID: 31630961]
  11. J Biomed Inform. 2020 May;105:103419 [PMID: 32298847]
  12. Clin Microbiol Infect. 2009 Jun;15(6):560-4 [PMID: 19392906]
  13. JAMA Neurol. 2021 Nov 1;78(11):1333-1344 [PMID: 34542573]
  14. J Am Med Inform Assoc. 2011 Sep-Oct;18(5):552-6 [PMID: 21685143]
  15. Comput Methods Programs Biomed. 2018 Oct;164:75-85 [PMID: 30195433]
  16. BMC Neurol. 2017 Nov 6;17(1):195 [PMID: 29110638]
  17. BMC Med Inform Decis Mak. 2019 Apr 4;19(Suppl 3):74 [PMID: 30943972]
  18. J Neurol. 2020 Feb;267(2):449-460 [PMID: 31667626]
  19. NPJ Digit Med. 2020 Mar 26;3:47 [PMID: 32258429]
  20. Neurol Sci. 2017 Oct;38(Suppl 2):237-242 [PMID: 29030766]
  21. Lancet Neurol. 2018 Sep;17(9):760-772 [PMID: 30049614]
  22. BMC Med Inform Decis Mak. 2020 Nov 30;20(1):310 [PMID: 33256715]
  23. Med Clin (Barc). 2021 Mar 26;156(6):302-304 [PMID: 32183986]
  24. Neurol Sci. 2015 Dec;36(12):2191-7 [PMID: 26205533]
  25. Bioinformatics. 2020 Feb 15;36(4):1234-1240 [PMID: 31501885]
  26. Int J Neurosci. 2017 Feb;127(2):109-117 [PMID: 26887329]
  27. Front Neurol. 2018 Jun 08;9:434 [PMID: 29951031]
  28. Sci Rep. 2021 Dec 8;11(1):23705 [PMID: 34880354]
  29. Lancet Neurol. 2016 Apr;15(4):391-404 [PMID: 26906964]
  30. J Neurosci. 2013 Nov 13;33(46):18161-74 [PMID: 24227725]
  31. J Clin Epidemiol. 2022 Feb;142:252-257 [PMID: 34748907]
  32. Emerg Med Clin North Am. 2016 Nov;34(4):917-942 [PMID: 27741995]
  33. Annu Rev Biomed Data Sci. 2021 Jul 20;4:165-187 [PMID: 34465177]
  34. Neurol Sci. 2017 Apr;38(4):703-706 [PMID: 27778112]
  35. Mult Scler Relat Disord. 2021 Jan;47:102604 [PMID: 33130468]
  36. Neuroimaging Clin N Am. 2012 Nov;22(4):791-9 [PMID: 23122267]
  37. Lancet Infect Dis. 2010 Dec;10(12):835-44 [PMID: 20952256]

Grants

  1. 81671185/National Natural Science Foundation of China

Word Cloud

Created with Highcharts 10.0.0diagnosisAEdiagnosticmodelcriteriabasedresultsdatasetencephalitisassistedearlysensitivitytextHPIsymptoms95% CIEarlytestsdefinitiveusinglanguageestablishedclassificationIEproposedstandardfourmodelsperformance7%etiologicaltreatmenteffectivelyautoimmuneprocessingpreviouslytrainedpresentmedicalrecordsEMRsdefiniteGrausautomaticallycasesparaclinicalbestindependenttesting86693-962%NaturalAutoimmuneINTRODUCTION:canimproveprognosispatientsHoweveranti-neuronalantibodyproviderequiretimealwaysabnormalnaturalNLPtechnologystudyproposesmethodclinicalcomparesMETHODS:historyillnesselectronicpathologicalinfectioustraditionalexaminationsneuronalantibodieslimbicetalusedreferenceantibody-negativeFirstrecognizedextractedtextstraining552Second199differentialpost-structuringeverycompletedEnglishprocessnormalizationsynonymsoptimalidentifiedevaluatingcomparingFinallycombinedthreetypicalcerebrospinalfluidCSFmagneticresonanceimagingMRIelectroencephalogramEEGbasisRESULTS:comparisonappliedshowednaïveBayesianclassifierbagwordsachievedareareceiveroperatingcharacteristiccurve085accuracy845%95%confidenceinterval[CI]740-920%specificity829%679-928%respectivelyComparedpossibleimproved733%541-87CONCLUSIONS:increasecomparedpreviousassistphysiciansestablishinginputtingaccordingnarrativehabitsdescribingavoidingmisdiagnosisallowingpromptinitiationspecificConstructionAssistedModelBasedLanguageProcessingAutomaticDiagnosisEncephalitisComputer-assistedElectronic

Similar Articles

Cited By