Semantic relations for problem-oriented medical records.

Ozlem Uzuner, Jonathan Mailoa, Russell Ryan, Tawanda Sibanda
Author Information
  1. Ozlem Uzuner: University at Albany, State University of New York, 135 Western Ave., Draper 114A, Albany, NY 12222, USA. ouzuner@albany.edu

Abstract

OBJECTIVE: We describe semantic relation (SR) classification on medical discharge summaries. We focus on relations targeted to the creation of problem-oriented records. Thus, we define relations that involve the medical problems of patients.
METHODS AND MATERIALS: We represent patients' medical problems with their diseases and symptoms. We study the relations of patients' problems with each other and with concepts that are identified as tests and treatments. We present an SR classifier that studies a corpus of patient records one sentence at a time. For all pairs of concepts that appear in a sentence, this SR classifier determines the relations between them. In doing so, the SR classifier takes advantage of surface, lexical, and syntactic features and uses these features as input to a support vector machine. We apply our SR classifier to two sets of medical discharge summaries, one obtained from the Beth Israel-Deaconess Medical Center (BIDMC), Boston, MA and the other from Partners Healthcare, Boston, MA.
RESULTS: On the BIDMC corpus, our SR classifier achieves micro-averaged F-measures that range from 74% to 95% on the various relation types. On the Partners corpus, the micro-averaged F-measures on the various relation types range from 68% to 91%. Our experiments show that lexical features (in particular, tokens that occur between candidate concepts, which we refer to as inter-concept tokens) are very informative for relation classification in medical discharge summaries. Using only the inter-concept tokens in the corpus, our SR classifier can recognize 84% of the relations in the BIDMC corpus and 72% of the relations in the Partners corpus.
CONCLUSION: These results are promising for semantic indexing of medical records. They imply that we can take advantage of lexical patterns in discharge summaries for relation classification at a sentence level.

References

  1. Proc AMIA Symp. 2000;:704-8 [PMID: 11079975]
  2. Stud Health Technol Inform. 2004;107(Pt 2):758-62 [PMID: 15360914]
  3. Bioinformatics. 2001;17 Suppl 1:S74-82 [PMID: 11472995]
  4. Bioinformatics. 2007 Feb 1;23(3):365-71 [PMID: 17142812]
  5. Proc AMIA Symp. 2002;:722-6 [PMID: 12463919]
  6. Stud Health Technol Inform. 2005;116:805-10 [PMID: 16160357]
  7. AMIA Annu Symp Proc. 2003;:639-43 [PMID: 14728251]
  8. N Engl J Med. 1968 Mar 21;278(12):652-7 concl [PMID: 5637250]
  9. BMC Bioinformatics. 2008 Nov 19;9 Suppl 11:S3 [PMID: 19025689]
  10. BMC Med Inform Decis Mak. 2005 Aug 31;5:30 [PMID: 16135244]
  11. J Am Med Inform Assoc. 1994 Mar-Apr;1(2):161-74 [PMID: 7719797]
  12. AMIA Annu Symp Proc. 2006;:714-8 [PMID: 17238434]
  13. J Am Med Inform Assoc. 2005 May-Jun;12(3):296-8 [PMID: 15684123]
  14. BMC Bioinformatics. 2008 Apr 23;9:207 [PMID: 18433469]
  15. BMC Bioinformatics. 2010 Feb 23;11:101 [PMID: 20178611]
  16. J Am Med Inform Assoc. 2009 Jan-Feb;16(1):109-15 [PMID: 18952931]

Grants

  1. U54 LM008748/NLM NIH HHS
  2. U54LM008748/NLM NIH HHS

MeSH Term

Artificial Intelligence
Humans
Medical Records Systems, Computerized
Medical Records, Problem-Oriented
Patient Care Planning
Patient Discharge
Semantics

Word Cloud

Created with Highcharts 10.0.0SRmedicalrelationsclassifiercorpusrelationdischargesummariesrecordsclassificationproblemsconceptssentencelexicalfeaturesBIDMCPartnerstokenssemanticproblem-orientedpatients'oneadvantageBostonMAmicro-averagedF-measuresrangevarioustypesinter-conceptcanOBJECTIVE:describefocustargetedcreationThusdefineinvolvepatientsMETHODSANDMATERIALS:representdiseasessymptomsstudyidentifiedteststreatmentspresentstudiespatienttimepairsappeardeterminestakessurfacesyntacticusesinputsupportvectormachineapplytwosetsobtainedBethIsrael-DeaconessMedicalCenterHealthcareRESULTS:achieves74%95%68%91%experimentsshowparticularoccurcandidatereferinformativeUsingrecognize84%72%CONCLUSION:resultspromisingindexingimplytakepatternslevelSemantic

Similar Articles

Cited By