Dynamical predictive coding with reservoir computing performs noise-robust multi-sensory speech recognition.

Yoshihiro Yonemura, Yuichi Katori
Author Information
  1. Yoshihiro Yonemura: Graduate of System Information Science, Future University Hakodate, Hakodate, Hokkaido, Japan.
  2. Yuichi Katori: Graduate of System Information Science, Future University Hakodate, Hakodate, Hokkaido, Japan.

Abstract

Multi-sensory integration is a perceptual process through which the brain synthesizes a unified perception by integrating inputs from multiple sensory modalities. A key issue is understanding how the brain performs multi-sensory integrations using a common neural basis in the cortex. A cortical model based on reservoir computing has been proposed to elucidate the role of recurrent connectivity among cortical neurons in this process. Reservoir computing is well-suited for time series processing, such as speech recognition. This inquiry focuses on extending a reservoir computing-based cortical model to encompass multi-sensory integration within the cortex. This research introduces a dynamical model of multi-sensory speech recognition, leveraging predictive coding combined with reservoir computing. Predictive coding offers a framework for the hierarchical structure of the cortex. The model integrates reliability weighting, derived from the computational theory of multi-sensory integration, to adapt to multi-sensory time series processing. The model addresses a multi-sensory speech recognition task, necessitating the management of complex time series. We observed that the reservoir effectively recognizes speech by extracting time-contextual information and weighting sensory inputs according to sensory noise. These findings indicate that the dynamic properties of recurrent networks are applicable to multi-sensory time series processing, positioning reservoir computing as a suitable model for multi-sensory integration.

Keywords

References

  1. Neurosci Res. 2003 Nov;47(3):277-87 [PMID: 14568109]
  2. Front Hum Neurosci. 2019 Sep 26;13:335 [PMID: 31611780]
  3. Trends Cogn Sci. 2006 Jun;10(6):278-85 [PMID: 16713325]
  4. PLoS Comput Biol. 2016 Jun 10;12(6):e1004967 [PMID: 27286251]
  5. Nature. 1976 Dec 23-30;264(5588):746-8 [PMID: 1012311]
  6. Neuron. 2009 Aug 27;63(4):544-57 [PMID: 19709635]
  7. Trends Neurosci. 2004 Dec;27(12):712-9 [PMID: 15541511]
  8. Front Psychol. 2016 Nov 18;7:1792 [PMID: 27917138]
  9. Curr Biol. 2004 Feb 3;14(3):257-62 [PMID: 14761661]
  10. Cogn Process. 2007 Sep;8(3):159-66 [PMID: 17429704]
  11. Nature. 2002 Jan 24;415(6870):429-33 [PMID: 11807554]
  12. Nat Neurosci. 1999 Jan;2(1):79-87 [PMID: 10195184]
  13. J Neurosci. 2011 Feb 2;31(5):1704-14 [PMID: 21289179]
  14. Front Syst Neurosci. 2010 Jun 23;4:25 [PMID: 20631844]
  15. Front Integr Neurosci. 2015 Mar 26;9:19 [PMID: 25859192]
  16. J Physiol Paris. 2004 Jan-Jun;98(1-3):191-205 [PMID: 15477032]
  17. PLoS Biol. 2015 Feb 24;13(2):e1002073 [PMID: 25710328]
  18. Sci Rep. 2018 Mar 1;8(1):3843 [PMID: 29497060]
  19. Nature. 1998 Feb 19;391(6669):756 [PMID: 9486643]
  20. Nat Rev Neurosci. 2008 Apr;9(4):255-66 [PMID: 18354398]
  21. PLoS One. 2009;4(3):e4638 [PMID: 19259259]
  22. Neuroimage. 2009 Feb 1;44(3):1210-23 [PMID: 18973818]

Word Cloud

Created with Highcharts 10.0.0multi-sensoryreservoirmodelcomputingspeechintegrationrecognitiontimeseriescodingsensorycortexcorticalprocessingpredictiveprocessbraininputsperformsrecurrentweightingMulti-sensoryperceptualsynthesizesunifiedperceptionintegratingmultiplemodalitieskeyissueunderstandingintegrationsusingcommonneuralbasisbasedproposedelucidateroleconnectivityamongneuronsReservoirwell-suitedinquiryfocusesextendingcomputing-basedencompasswithinresearchintroducesdynamicalleveragingcombinedPredictiveoffersframeworkhierarchicalstructureintegratesreliabilityderivedcomputationaltheoryadaptaddressestasknecessitatingmanagementcomplexobservedeffectivelyrecognizesextractingtime-contextualinformationaccordingnoisefindingsindicatedynamicpropertiesnetworksapplicablepositioningsuitableDynamicalnoise-robustnonlineardynamics

Similar Articles

Cited By

No available data.