Recovering sound sources from embedded repetition.

Josh H McDermott, David Wrobleski, Andrew J Oxenham
Author Information
  1. Josh H McDermott: Center for Neural Science, New York University, New York, NY 10003, USA. jhm@cns.nyu.edu

Abstract

Cocktail parties and other natural auditory environments present organisms with mixtures of sounds. Segregating individual sound sources is thought to require prior knowledge of source properties, yet these presumably cannot be learned unless the sources are segregated first. Here we show that the auditory system can bootstrap its way around this problem by identifying sound sources as repeating patterns embedded in the acoustic input. Due to the presence of competing sounds, source repetition is not explicit in the input to the ear, but it produces temporal regularities that listeners detect and use for segregation. We used a simple generative model to synthesize novel sounds with naturalistic properties. We found that such sounds could be segregated and identified if they occurred more than once across different mixtures, even when the same sounds were impossible to segregate in single mixtures. Sensitivity to the repetition of sound sources can permit their recovery in the absence of other segregation cues or prior knowledge of sounds, and could help solve the cocktail party problem.

References

  1. Nature. 1999 Jan 14;397(6715):154-7 [PMID: 9923676]
  2. J Acoust Soc Am. 2003 Nov;114(5):2835-45 [PMID: 14650018]
  3. J Acoust Soc Am. 2003 Dec;114(6 Pt 1):3394-411 [PMID: 14714819]
  4. J Acoust Soc Am. 1991 Jan;89(1):329-40 [PMID: 2002172]
  5. Curr Biol. 2006 Feb 7;16(3):264-71 [PMID: 16461279]
  6. Exp Psychol. 2004;51(4):240-8 [PMID: 15620225]
  7. J Acoust Soc Am. 1992 Jun;91(6):3381-90 [PMID: 1619115]
  8. Nat Neurosci. 2007 Dec;10(12):1601-7 [PMID: 17994016]
  9. Proc Natl Acad Sci U S A. 2008 Apr 15;105(15):5939-44 [PMID: 18391210]
  10. J Neurosci. 2003 Dec 17;23(37):11489-504 [PMID: 14684853]
  11. Neuron. 2010 May 27;66(4):610-8 [PMID: 20510864]
  12. Psychol Bull. 2007 Sep;133(5):780-99 [PMID: 17723030]
  13. Can J Psychol. 1978 Mar;32(1):19-31 [PMID: 728845]
  14. Nat Neurosci. 2005 Oct;8(10):1371-9 [PMID: 16136039]
  15. J Acoust Soc Am. 1998 Oct;104(4):2326-38 [PMID: 10491697]
  16. Nature. 2006 Feb 23;439(7079):978-82 [PMID: 16495999]
  17. J Acoust Soc Am. 2008 Dec;124(6):3751-71 [PMID: 19206802]
  18. J Acoust Soc Am. 1999 Nov;106(5):2719-32 [PMID: 10573888]
  19. J Comp Psychol. 2008 Aug;122(3):235-51 [PMID: 18729652]
  20. Science. 1996 Dec 13;274(5294):1926-8 [PMID: 8943209]
  21. J Acoust Soc Am. 2005 Dec;118(6):3766-73 [PMID: 16419821]
  22. J Acoust Soc Am. 1994 Jun;95(6):3475-80 [PMID: 8046139]
  23. Curr Biol. 2009 Dec 1;19(22):R1024-7 [PMID: 19948136]
  24. J Acoust Soc Am. 1993 Jun;93(6):3454-67 [PMID: 8326071]
  25. Science. 1970 Jan 23;167(3917):392-3 [PMID: 5409744]

Grants

  1. R01 DC007657/NIDCD NIH HHS

MeSH Term

Acoustic Stimulation
Adult
Auditory Pathways
Auditory Perception
Cues
Female
Humans
Male
Noise
Pattern Recognition, Physiological
Periodicity
Psychoacoustics
Sound Spectrography

Word Cloud

Created with Highcharts 10.0.0soundssourcessoundmixturesrepetitionauditorypriorknowledgesourcepropertiessegregatedcanproblemembeddedinputsegregationCocktailpartiesnaturalenvironmentspresentorganismsSegregatingindividualthoughtrequireyetpresumablylearnedunlessfirstshowsystembootstrapwayaroundidentifyingrepeatingpatternsacousticDuepresencecompetingexplicitearproducestemporalregularitieslistenersdetectuseusedsimplegenerativemodelsynthesizenovelnaturalisticfoundidentifiedoccurredacrossdifferentevenimpossiblesegregatesingleSensitivitypermitrecoveryabsencecueshelpsolvecocktailpartyRecovering

Similar Articles

Cited By (47)