Multilayer Exponential Family Factor models for integrative analysis and learning disease progression.

Qinxia Wang, Yuanjia Wang
Author Information
  1. Qinxia Wang: Department of Biostatistics, Mailman School of Public Health, Columbia University, 722 W168th Street, New York, 10032, USA.
  2. Yuanjia Wang: Department of Biostatistics, Mailman School of Public Health, Columbia University, 722 W168th Street, New York, 10032, USA.

Abstract

Current diagnosis of neurological disorders often relies on late-stage clinical symptoms, which poses barriers to developing effective interventions at the premanifest stage. Recent research suggests that biomarkers and subtle changes in clinical markers may occur in a time-ordered fashion and can be used as indicators of early disease. In this article, we tackle the challenges to leverage multidomain markers to learn early disease progression of neurological disorders. We propose to integrate heterogeneous types of measures from multiple domains (e.g., discrete clinical symptoms, ordinal cognitive markers, continuous neuroimaging, and blood biomarkers) using a hierarchical Multilayer Exponential Family Factor (MEFF) model, where the observations follow exponential family distributions with lower-dimensional latent factors. The latent factors are decomposed into shared factors across multiple domains and domain-specific factors, where the shared factors provide robust information to perform extensive phenotyping and partition patients into clinically meaningful and biologically homogeneous subgroups. Domain-specific factors capture remaining unique variations for each domain. The MEFF model also captures nonlinear trajectory of disease progression and orders critical events of neurodegeneration measured by each marker. To overcome computational challenges, we fit our model by approximate inference techniques for large-scale data. We apply the developed method to Parkinson's Progression Markers Initiative data to integrate biological, clinical, and cognitive markers arising from heterogeneous distributions. The model learns lower-dimensional representations of Parkinson's disease (PD) and the temporal ordering of the neurodegeneration of PD.

Keywords

References

  1. Biometrics. 2018 Dec;74(4):1301-1310 [PMID: 29738627]
  2. J Neurosci. 2009 Dec 2;29(48):15223-31 [PMID: 19955375]
  3. Neurobiol Aging. 2005 Oct;26(9):1245-60; discussion 1279-82 [PMID: 16046030]
  4. Mov Disord. 2012 Aug;27(9):1125-8 [PMID: 22692724]
  5. Stat Med. 2018 Dec 30;37(30):4721-4742 [PMID: 30256435]
  6. Neurodegener Dis. 2018;18(4):173-190 [PMID: 30089306]
  7. Gend Med. 2007 Mar;4(1):8-18 [PMID: 17584622]
  8. Ann Appl Stat. 2013 Mar 1;7(1):523-542 [PMID: 23745156]
  9. Neuron. 2014 Nov 5;84(3):594-607 [PMID: 25442938]
  10. Drug Discov Today. 2013 Feb;18(3-4):155-62 [PMID: 22982303]
  11. Lancet Neurol. 2010 Jan;9(1):119-28 [PMID: 20083042]
  12. Mov Disord. 2007 Dec;22(16):2386-93 [PMID: 17894337]
  13. Biometrics. 2019 Dec;75(4):1121-1132 [PMID: 31254385]
  14. Prog Neurobiol. 2011 Dec;95(4):629-35 [PMID: 21930184]
  15. Alzheimers Dement. 2014 Oct;10(5 Suppl):S400-10 [PMID: 24656849]
  16. Biostatistics. 2020 Apr 1;21(2):302-318 [PMID: 30247540]

Grants

  1. NS073671/NIH HHS
  2. /NIH HHS
  3. R01 MH123487/NIMH NIH HHS
  4. R01 NS073671/NINDS NIH HHS
  5. R01 GM124104/NIGMS NIH HHS

MeSH Term

Humans
Disease Progression
Parkinson Disease
Biomarkers
Neuroimaging

Chemicals

Biomarkers

Word Cloud

Created with Highcharts 10.0.0diseasefactorsclinicalmarkersmodeldisordersprogressionFactormodelsneurologicalsymptomsbiomarkersearlychallengesintegrateheterogeneousmultipledomainscognitiveMultilayerExponentialFamilyMEFFdistributionslower-dimensionallatentsharedneurodegenerationinferencedataParkinson'sPDCurrentdiagnosisoftenrelieslate-stageposesbarriersdevelopingeffectiveinterventionspremanifeststageRecentresearchsuggestssubtlechangesmayoccurtime-orderedfashioncanusedindicatorsarticletackleleveragemultidomainlearnproposetypesmeasuresegdiscreteordinalcontinuousneuroimagingbloodusinghierarchicalobservationsfollowexponentialfamilydecomposedacrossdomain-specificproviderobustinformationperformextensivephenotypingpartitionpatientsclinicallymeaningfulbiologicallyhomogeneoussubgroupsDomain-specificcaptureremaininguniquevariationsdomainalsocapturesnonlineartrajectoryorderscriticaleventsmeasuredmarkerovercomecomputationalfitapproximatetechniqueslarge-scaleapplydevelopedmethodProgressionMarkersInitiativebiologicalarisinglearnsrepresentationstemporalorderingintegrativeanalysislearningDataintegrationInflectionpointsNeurologicalParkinson’sProbabilisticVariational

Similar Articles

Cited By