Testing generalized linear models with high-dimensional nuisance parameter.

Jinsong Chen, Quefeng Li, Hua Yun Chen
Author Information
  1. Jinsong Chen: College of Applied Health Sciences, University of Illinois at Chicago, 1919 W Taylor St, Chicago, Illinois 60612, U.S.A.
  2. Quefeng Li: Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, U.S.A.
  3. Hua Yun Chen: School of Public Health, University of Illinois at Chicago, 2121 W Taylor St, Chicago, Illinois 60612, U.S.A.

Abstract

Generalized linear models often have a high-dimensional nuisance parameters, as seen in applications such as testing gene-environment interactions or gene-gene interactions. In these scenarios, it is essential to test the significance of a high-dimensional sub-vector of the model's coefficients. Although some existing methods can tackle this problem, they often rely on the bootstrap to approximate the asymptotic distribution of the test statistic, and thus are computationally expensive. Here, we propose a computationally efficient test with a closed-form limiting distribution, which allows the parameter being tested to be either sparse or dense. We show that under certain regularity conditions, the type I error of the proposed method is asymptotically correct, and we establish its power under high-dimensional alternatives. Extensive simulations demonstrate the good performance of the proposed test and its robustness when certain sparsity assumptions are violated. We also apply the proposed method to Chinese famine sample data in order to show its performance when testing the significance of gene-environment interactions.

Keywords

References

  1. J Am Stat Assoc. 2017;112(517):64-76 [PMID: 28736464]
  2. J Am Stat Assoc. 2021;116(535):1472-1486 [PMID: 34538987]
  3. Proc Natl Acad Sci U S A. 2019 Jul 16;116(29):14516-14525 [PMID: 31262828]
  4. Ann Stat. 2019 Oct;47(5):2671-2703 [PMID: 31534282]
  5. NPJ Schizophr. 2018 Aug 21;4(1):16 [PMID: 30131491]
  6. J Am Stat Assoc. 2014;109(507):1285-1301 [PMID: 25386043]
  7. Biometrics. 2018 Sep;74(3):1104-1111 [PMID: 29228454]
  8. Biometrics. 2016 Mar;72(1):85-94 [PMID: 26288029]
  9. Bioinformatics. 2009 Mar 15;25(6):714-21 [PMID: 19176549]
  10. J Mach Learn Res. 2020;21: [PMID: 32802002]
  11. J Am Stat Assoc. 2021;116(534):984-998 [PMID: 34421157]
  12. J Stat Softw. 2010;33(1):1-22 [PMID: 20808728]
  13. Ann Stat. 2013 Jun;41(3):1111-1141 [PMID: 26257447]
  14. J R Stat Soc Series B Stat Methodol. 2017 Jan;79(1):247-265 [PMID: 28479862]

Grants

  1. R01 AG073259/NIA NIH HHS

Word Cloud

Created with Highcharts 10.0.0high-dimensionaltestinteractionsparameterproposedlinearmodelsoftennuisancetestinggene-environmentsignificancedistributioncomputationallyshowcertainmethodperformanceGeneralizedparametersseenapplicationsgene-genescenariosessentialsub-vectormodel'scoefficientsAlthoughexistingmethodscantackleproblemrelybootstrapapproximateasymptoticstatisticthusexpensiveproposeefficientclosed-formlimitingallowstestedeithersparsedenseregularityconditionstypeerrorasymptoticallycorrectestablishpoweralternativesExtensivesimulationsdemonstrategoodrobustnesssparsityassumptionsviolatedalsoapplyChinesefaminesampledataorderTestinggeneralizedDenseModelmisspecificationU-statistics

Similar Articles

Cited By

No available data.