Learning to localize sounds in a highly reverberant environment: Machine-learning tracking of dolphin whistle-like sounds in a pool.

Sean F Woodward, Diana Reiss, Marcelo O Magnasco
Author Information
  1. Sean F Woodward: Laboratory of Integrative Neuroscience, Center for Studies in Physics and Biology, The Rockefeller University, New York, NY, United States of America. ORCID
  2. Diana Reiss: Department of Psychology, Hunter College, City University of New York, New York, NY, United States of America.
  3. Marcelo O Magnasco: Laboratory of Integrative Neuroscience, Center for Studies in Physics and Biology, The Rockefeller University, New York, NY, United States of America.

Abstract

Tracking the origin of propagating wave signals in an environment with complex reflective surfaces is, in its full generality, a nearly intractable problem which has engendered multiple domain-specific literatures. We posit that, if the environment and sensor geometries are fixed, machine learning algorithms can "learn" the acoustical geometry of the environment and accurately track signal origin. In this paper, we propose the first machine-learning-based approach to identifying the source locations of semi-stationary, tonal, dolphin-whistle-like sounds in a highly reverberant space, specifically a half-cylindrical dolphin pool. Our algorithm works by supplying a learning network with an overabundance of location "clues", which are then selected under supervised training for their ability to discriminate source location in this particular environment. More specifically, we deliver estimated time-difference-of-arrivals (TDOA's) and normalized cross-correlation values computed from pairs of hydrophone signals to a random forest model for high-feature-volume classification and feature selection, and subsequently deliver the selected features into linear discriminant analysis, linear and quadratic Support Vector Machine (SVM), and Gaussian process models. Based on data from 14 sound source locations and 16 hydrophones, our classification models yielded perfect accuracy at predicting novel sound source locations. Our regression models yielded better accuracy than the established Steered-Response Power (SRP) method when all training data were used, and comparable accuracy along the pool surface when deprived of training data at testing sites; our methods additionally boast improved computation time and the potential for superior localization accuracy in all dimensions with more training data. Because of the generality of our method we argue it may be useful in a much wider variety of contexts.

Associated Data

figshare | 10.6084/m9.figshare.7956212

References

  1. Proc Biol Sci. 2001 Feb 7;268(1464):263-7 [PMID: 11217896]
  2. J Acoust Soc Am. 2012 Feb;131(2):1613-21 [PMID: 22352531]
  3. J Acoust Soc Am. 1998 Dec;104(6):3616-25 [PMID: 9857519]
  4. J Acoust Soc Am. 2000 Jun;107(6):3509-17 [PMID: 10875395]
  5. J Acoust Soc Am. 1993 Apr;93(4 Pt 1):2197-205 [PMID: 8473632]
  6. J Acoust Soc Am. 2013 Sep;134(3):2589-95 [PMID: 23968056]
  7. Rev Sci Instrum. 2016 Apr;87(4):041502 [PMID: 27131647]
  8. J Acoust Soc Am. 2002 Oct;112(4):1692-701 [PMID: 12398474]
  9. J Acoust Soc Am. 2008 Aug;124(2):1159-70 [PMID: 18681604]
  10. Proc Biol Sci. 2003 Mar 7;270(1514):497-502 [PMID: 12641904]
  11. J Acoust Soc Am. 1999 Aug;106(2):837-46 [PMID: 10462789]
  12. Proc Natl Acad Sci U S A. 2001 May 8;98(10):5937-42 [PMID: 11331768]
  13. R Soc Open Sci. 2016 Jan 20;3(1):150372 [PMID: 26909165]
  14. J Acoust Soc Am. 2005 Dec;118(6):3575-84 [PMID: 16419803]
  15. J Acoust Soc Am. 1986 Aug;80(2):688-91 [PMID: 3745687]
  16. J Acoust Soc Am. 2007 Aug;122(2):1208-15 [PMID: 17672667]
  17. J Comp Physiol A Neuroethol Sens Neural Behav Physiol. 2013 Jun;199(6):479-89 [PMID: 23649908]
  18. J Acoust Soc Am. 2000 Sep;108(3 Pt 1):1353-6 [PMID: 11008838]
  19. J Acoust Soc Am. 1985 Nov;78(5):1892-5 [PMID: 4067084]
  20. Science. 1961 Dec 8;134(3493):1873-6 [PMID: 17831115]
  21. J Acoust Soc Am. 2013 Sep;134(3):2383-92 [PMID: 23968035]
  22. Sci Rep. 2014 Nov 27;4:7215 [PMID: 25427517]
  23. J Acoust Soc Am. 2011 Jul;130(1):102-12 [PMID: 21786881]
  24. Int J Comp Psychol. 2010;23(3):401-413 [PMID: 24493908]
  25. Anim Behav. 1999 Feb;57(2):409-419 [PMID: 10049481]
  26. Proc Natl Acad Sci U S A. 2005 Jun 21;102(25):8939-43 [PMID: 15947077]

MeSH Term

Acoustics
Animals
Dolphins
Machine Learning
Vocalization, Animal

Word Cloud

Created with Highcharts 10.0.0environmentsourcetrainingdataaccuracylocationssoundspoolmodelsoriginsignalsgeneralitylearninghighlyreverberantspecificallydolphinlocationselecteddeliverclassificationlinearsoundyieldedmethodTrackingpropagatingwavecomplexreflectivesurfacesfullnearlyintractableproblemengenderedmultipledomain-specificliteraturespositsensorgeometriesfixedmachinealgorithmscan"learn"acousticalgeometryaccuratelytracksignalpaperproposefirstmachine-learning-basedapproachidentifyingsemi-stationarytonaldolphin-whistle-likespacehalf-cylindricalalgorithmworkssupplyingnetworkoverabundance"clues"supervisedabilitydiscriminateparticularestimatedtime-difference-of-arrivalsTDOA'snormalizedcross-correlationvaluescomputedpairshydrophonerandomforestmodelhigh-feature-volumefeatureselectionsubsequentlyfeaturesdiscriminantanalysisquadraticSupportVectorMachineSVMGaussianprocessBased1416hydrophonesperfectpredictingnovelregressionbetterestablishedSteered-ResponsePowerSRPusedcomparablealongsurfacedeprivedtestingsitesmethodsadditionallyboastimprovedcomputationtimepotentialsuperiorlocalizationdimensionsarguemayusefulmuchwidervarietycontextsLearninglocalizeenvironment:Machine-learningtrackingwhistle-like

Similar Articles

Cited By