Speech Signal and Facial Image Processing for Obstructive Sleep Apnea Assessment.

Fernando Espinoza-Cuadros, Rubén Fernández-Pozo, Doroteo T Toledano, José D Alcázar-Ramírez, Eduardo López-Gonzalo, Luis A Hernández-Gómez
Author Information
  1. Fernando Espinoza-Cuadros: GAPS Signal Processing Applications Group, Universidad Politécnica de Madrid, 28040 Madrid, Spain.
  2. Rubén Fernández-Pozo: GAPS Signal Processing Applications Group, Universidad Politécnica de Madrid, 28040 Madrid, Spain.
  3. Doroteo T Toledano: ATVS Biometric Recognition Group, Universidad Autónoma de Madrid, Madrid, Spain.
  4. José D Alcázar-Ramírez: Respiratory Department, Sleep Unit, Hospital Quirón, Málaga, Spain.
  5. Eduardo López-Gonzalo: GAPS Signal Processing Applications Group, Universidad Politécnica de Madrid, 28040 Madrid, Spain.
  6. Luis A Hernández-Gómez: GAPS Signal Processing Applications Group, Universidad Politécnica de Madrid, 28040 Madrid, Spain.

Abstract

Obstructive sleep apnea (OSA) is a common sleep disorder characterized by recurring breathing pauses during sleep caused by a blockage of the upper airway (UA). OSA is generally diagnosed through a costly procedure requiring an overnight stay of the patient at the hospital. This has led to proposing less costly procedures based on the analysis of patients' facial images and voice recordings to help in OSA detection and severity assessment. In this paper we investigate the use of both image and speech processing to estimate the apnea-hypopnea index, AHI (which describes the severity of the condition), over a population of 285 male Spanish subjects suspected to suffer from OSA and referred to a Sleep Disorders Unit. Photographs and voice recordings were collected in a supervised but not highly controlled way trying to test a scenario close to an OSA assessment application running on a mobile device (i.e., smartphones or tablets). Spectral information in speech utterances is modeled by a state-of-the-art low-dimensional acoustic representation, called i-vector. A set of local craniofacial features related to OSA are extracted from images after detecting facial landmarks using Active Appearance Models (AAMs). Support vector regression (SVR) is applied on facial features and i-vectors to estimate the AHI.

References

  1. Chest. 1989 Sep;96(3):589-95 [PMID: 2766817]
  2. Sleep. 2010 Sep;33(9):1249-54 [PMID: 20857873]
  3. Annu Int Conf IEEE Eng Med Biol Soc. 2014;2014:4232-5 [PMID: 25570926]
  4. Respir Care. 1995 Dec;40(12):1336-43 [PMID: 10153260]
  5. Am J Orthod Dentofacial Orthop. 1995 Jun;107(6):589-95 [PMID: 7771363]
  6. Am J Respir Crit Care Med. 2003 Sep 1;168(5):522-30 [PMID: 12746251]
  7. J Laryngol Otol. 1989 Mar;103(3):287-92 [PMID: 2703770]
  8. Sleep Med. 2005 Nov;6(6):497-505 [PMID: 15994120]
  9. Sleep. 2006 Jul;29(7):903-8 [PMID: 16895257]
  10. Sleep. 2009 Jan;32(1):46-52 [PMID: 19189778]
  11. J Acoust Soc Am. 1989 Apr;85(4):1699-707 [PMID: 2708686]
  12. IEEE Trans Pattern Anal Mach Intell. 2014 Dec;36(12):2483-509 [PMID: 26353153]
  13. Chest. 1988 Jan;93(1):104-9 [PMID: 3335138]
  14. Chest. 1987 Oct;92(4):670-5 [PMID: 3308347]
  15. Chest. 1993 Oct;104(4):1093-6 [PMID: 8404173]
  16. Am J Respir Crit Care Med. 1998 Jan;157(1):280-3 [PMID: 9445310]
  17. Annu Int Conf IEEE Eng Med Biol Soc. 2014;2014:4224-7 [PMID: 25570924]
  18. J Clin Sleep Med. 2010 Dec 15;6(6):545-9 [PMID: 21206744]
  19. Biomed Eng Online. 2016 Feb 20;15:20 [PMID: 26897500]
  20. J Voice. 2016 Jan;30(1):21-9 [PMID: 25795368]
  21. IEEE Trans Biomed Eng. 2011 May;58(5):1373-82 [PMID: 21172747]
  22. J Clin Sleep Med. 2013 Sep 15;9(9):845-52 [PMID: 23997695]
  23. Acta Otolaryngol. 1997 Sep;117(5):760-3 [PMID: 9349877]
  24. Indian J Med Res. 2010 Feb;131:165-70 [PMID: 20308741]
  25. Eur Respir J. 2011 Aug;38(2):348-58 [PMID: 21233264]
  26. Sleep. 2009 Jan;32(1):37-45 [PMID: 19189777]

MeSH Term

Adult
Aged
Aged, 80 and over
Computational Biology
Face
Humans
Image Interpretation, Computer-Assisted
Male
Middle Aged
Phonation
Photography
Sleep Apnea, Obstructive
Speech Acoustics
Speech Articulation Tests
Young Adult

Word Cloud

Created with Highcharts 10.0.0OSAsleepfacialObstructivecostlyimagesvoicerecordingsseverityassessmentspeechestimateAHISleepfeaturesapneacommondisordercharacterizedrecurringbreathingpausescausedblockageupperairwayUAgenerallydiagnosedprocedurerequiringovernightstaypatienthospitalledproposinglessproceduresbasedanalysispatients'helpdetectionpaperinvestigateuseimageprocessingapnea-hypopneaindexdescribesconditionpopulation285maleSpanishsubjectssuspectedsufferreferredDisordersUnitPhotographscollectedsupervisedhighlycontrolledwaytryingtestscenariocloseapplicationrunningmobiledeviceiesmartphonestabletsSpectralinformationutterancesmodeledstate-of-the-artlow-dimensionalacousticrepresentationcalledi-vectorsetlocalcraniofacialrelatedextracteddetectinglandmarksusingActiveAppearanceModelsAAMsSupportvectorregressionSVRappliedi-vectorsSpeechSignalFacialImageProcessingApneaAssessment

Similar Articles

Cited By