Graphical Model Inference with Erosely Measured Data.

Lili Zheng, Genevera I Allen
Author Information
  1. Lili Zheng: Department of Electrical and Computer Engineering, Rice University.
  2. Genevera I Allen: Department of Electrical and Computer Engineering, Rice University.

Abstract

In this paper, we investigate the Gaussian graphical model inference problem in a novel setting that we call measurements, referring to irregularly measured or observed data. For graphs, this results in different node pairs having vastly different sample sizes which frequently arises in data integration, genomics, neuroscience, and sensor networks. Existing works characterize the graph selection performance using the minimum pairwise sample size, which provides little insights for erosely measured data, and no existing inference method is applicable. We aim to fill in this gap by proposing the first inference method that characterizes the different uncertainty levels over the graph caused by the erose measurements, named GI-JOE (raph nference when oint bservations are rose). Specifically, we develop an edge-wise inference method and an affiliated FDR control procedure, where the variance of each edge depends on the sample sizes associated with corresponding neighbors. We prove statistical validity under erose measurements, thanks to careful localized edge-wise analysis and disentangling the dependencies across the graph. Finally, through simulation studies and a real neuroscience data example, we demonstrate the advantages of our inference methods for graph selection from erosely measured data.

Keywords

References

  1. Nature. 2007 Jan 11;445(7124):168-76 [PMID: 17151600]
  2. ACM BCB. 2020 Sep;2020: [PMID: 34278382]
  3. Biostatistics. 2008 Jul;9(3):432-41 [PMID: 18079126]
  4. Biometrika. 2015 Mar;102(1):47-64 [PMID: 27625437]
  5. BMC Bioinformatics. 2018 Jun 8;19(1):220 [PMID: 29884114]
  6. Proc Natl Acad Sci U S A. 2015 Jun 9;112(23):7285-90 [PMID: 26060301]
  7. Nat Methods. 2018 Jul;15(7):539-542 [PMID: 29941873]
  8. Science. 2019 Apr 19;364(6437):255 [PMID: 31000656]
  9. Ann Appl Stat. 2018 Jun;12(2):1068-1095 [PMID: 31772696]
  10. F1000Res. 2019 Oct 17;8:1769 [PMID: 32148761]
  11. J Mach Learn Res. 2012 Apr;13:1059-1062 [PMID: 26834510]
  12. J Am Stat Assoc. 2016;111(514):621-633 [PMID: 28042188]
  13. J Mach Learn Res. 2015 Dec;16:3813-3847 [PMID: 27570498]
  14. Genome Biol. 2016 Aug 17;17(1):173 [PMID: 27534536]

Grants

  1. R01 GM140468/NIGMS NIH HHS

Word Cloud

Created with Highcharts 10.0.0inferencedatagraphmeasurementsmeasureddifferentsampleselectionmethodsizesneuroscienceeroselyeroseedge-wiseFDRcontrolpaperinvestigateGaussiangraphicalmodelproblemnovelsettingcallreferringirregularlyobservedgraphsresultsnodepairsvastlyfrequentlyarisesintegrationgenomicssensornetworksExistingworkscharacterizeperformanceusingminimumpairwisesizeprovideslittleinsightsexistingapplicableaimfillgapproposingfirstcharacterizesuncertaintylevelscausednamedGI-JOEraphnferenceointbservationsroseSpecificallydevelopaffiliatedprocedurevarianceedgedependsassociatedcorrespondingneighborsprovestatisticalvaliditythankscarefullocalizedanalysisdisentanglingdependenciesacrossFinallysimulationstudiesrealexampledemonstrateadvantagesmethodsGraphicalModelInferenceEroselyMeasuredDataUnevenstructuremissing

Similar Articles

Cited By