Functional data analysis for longitudinal data with informative observation times.

Caleb Weaver, Luo Xiao, Wenbin Lu
Author Information
  1. Caleb Weaver: Department of Statistics, North Carolina State University, Raleigh, North Carolina, USA. ORCID
  2. Luo Xiao: Department of Statistics, North Carolina State University, Raleigh, North Carolina, USA. ORCID
  3. Wenbin Lu: Department of Statistics, North Carolina State University, Raleigh, North Carolina, USA. ORCID

Abstract

In functional data analysis for longitudinal data, the observation process is typically assumed to be noninformative, which is often violated in real applications. Thus, methods that fail to account for the dependence between observation times and longitudinal outcomes may result in biased estimation. For longitudinal data with informative observation times, we find that under a general class of shared random effect models, a commonly used functional data method may lead to inconsistent model estimation while another functional data method results in consistent and even rate-optimal estimation. Indeed, we show that the mean function can be estimated appropriately via penalized splines and that the covariance function can be estimated appropriately via penalized tensor-product splines, both with specific choices of parameters. For the proposed method, theoretical results are provided, and simulation studies and a real data analysis are conducted to demonstrate its performance.

Keywords

References

  1. Cai, N., Lu, W. and Zhang, H.H. (2012) Time-varying latent effect model for longitudinal data with informative observation times. Biometrics, 68, 1093-1102.
  2. Cai, T.T. and Yuan, M. (2011) Optimal estimation of the mean function based on discretely sampled functional data: phase transition. Annals of Statistics, 39, 2330-2355.
  3. Cella, D., Riley, W., Stone, A., Rothrock, N., Reeve, B., Yount, S., et al. (2010) The patient-reported outcomes measurement information system (PROMIS) developed and tested its first wave of adult self-reported health outcome item banks: 2005-2008. Journal of Clinical Epidemiology, 63, 1179-1194.
  4. Claeskens, G., Krivobokova, T. and Opsomer, J.D. (2009) Asymptotic properties of penalized spline estimators. Biometrika, 96, 529-544.
  5. Eilers, P.H. and Marx, B.D. (1996) Flexible smoothing with B-splines and penalties. Statistical Science, 11, 89-102.
  6. Eilers, P.H. and Marx, B.D. (2003) Multivariate calibration with temperature interaction using two-dimensional penalized signal regression. Chemometrics and Intelligent Laboratory Systems, 66, 159-174.
  7. Fang, S., Zhang, H. and Sun, L. (2016) Joint analysis of longitudinal data with additive mixed effect model for informative observation times. Journal of Statistical Planning and Inference, 169, 43-55.
  8. Group, P.S. (2004) Levodopa and the progression of Parkinson's disease. New England Journal of Medicine, 351, 2498-2508.
  9. Guo, W. (2002) Functional mixed effects models. Biometrics, 58, 121-128.
  10. Holden, S.K., Finseth, T., Sillau, S.H. and Berman, B.D. (2018) Progression of MDS-UPDRS scores over five years in De Novo Parkinson disease from the Parkinson's progression markers initiative cohort. Movement Disorders Clinical Practice, 5, 47-53.
  11. Huang, J.Z. and Su, Y. (2021) Asymptotic properties of penalized spline estimators in concave extended linear models: rates of convergence. Annals of Statistics, 49, 3383-3407.
  12. Li, Y. and Hsing, T. (2010) Uniform convergence rates for nonparametric regression and principal component analysis in functional/longitudinal data. Annals of Statistics, 38, 3321-3351.
  13. Liang, Y., Lu, W. and Ying, Z. (2009) Joint modeling and analysis of longitudinal data with informative observation times. Biometrics, 65, 377-384.
  14. Lin, H., Scharfstein, D.O. and Rosenheck, R.A. (2004) Analysis of longitudinal data with irregular, outcome-dependent follow-up. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 66, 791-813.
  15. Mischley, L.K., Lau, R.C. and Weiss, N.S. (2017) Use of a self-rating scale of the nature and severity of symptoms in Parkinson's Disease (PRO-PD): correlation with quality of life and existing scales of disease severity. npj Parkinson's Disease, 3, 1-7.
  16. Ramsay, J.O. and Silverman, B.W. (2006) Functional Data Analysis. New York: Springer.
  17. Song, X., Mu, X. and Sun, L. (2012) Regression analysis of longitudinal data with time-dependent covariates and informative observation times. Scandinavian Journal of Statistics, 39, 248-258.
  18. Stone, C.J. (1982) Optimal global rates of convergence for nonparametric regression. Annals of Statistics, 10, 1040-1053.
  19. Sun, J., Park, D.-H., Sun, L. and Zhao, X. (2005) Semiparametric regression analysis of longitudinal data with informative observation times. Journal of the American Statistical Association, 100, 882-889.
  20. Sun, J., Sun, L. and Liu, D. (2007) Regression analysis of longitudinal data in the presence of informative observation and censoring times. Journal of the American Statistical Association, 102, 1397-1406.
  21. Sun, L., Song, X., Zhou, J. and Liu, L. (2012) Joint analysis of longitudinal data with informative observation times and a dependent terminal event. Journal of the American Statistical Association, 107, 688-700.
  22. Venuto, C.S., Potter, N.B., Ray Dorsey, E. and Kieburtz, K. (2016) A review of disease progression models of Parkinson's disease and applications in clinical trials. Movement Disorders, 31, 947-956.
  23. Wang, M.-C., Qin, J. and Chiang, C.-T. (2001) Analyzing recurrent event data with informative censoring. Journal of the American Statistical Association, 96, 1057-1065.
  24. Xiao, L. (2019) Asymptotic theory of penalized splines. Electronic Journal of Statistics, 13, 747-794.
  25. Xiao, L. (2020) Asymptotic properties of penalized splines for functional data. Bernoulli, 26, 2847-2875.
  26. Xiao, L., Li, C., Checkley, W. and Crainiceanu, C. (2018) Fast covariance estimation for sparse functional data. Statistics and Computing, 28, 511-522.
  27. Yao, F., Müller, H.-G., Clifford, A.J., Dueker, S.R., Follett, J., Lin, Y., et al. (2003) Shrinkage estimation for functional principal component scores with application to the population kinetics of plasma folate. Biometrics, 59, 676-685.
  28. Yao, F., Müller, H.-G. and Wang, J.-L. (2005) Functional data analysis for sparse longitudinal data. Journal of the American Statistical Association, 100, 577-590.
  29. Zhang, X. and Wang, J.-L. (2016) From sparse to dense functional data and beyond. Annals of Statistics, 44, 2281-2321.
  30. Zhao, X., Tong, X. and Sun, L. (2012) Joint analysis of longitudinal data with dependent observation times. Statistica Sinica, 22, 317-336.
  31. Zhu, L., Sun, J., Tong, X., and Pounds, S. (2011) Regression analysis of longitudinal data with informative observation times and application to medical cost data. Statistics in Medicine, 30, 1429-1440.

Grants

  1. R01 AG064803/NIH HHS
  2. R01 NS112303/NIH HHS
  3. R56 AG064803/NIH HHS

MeSH Term

Models, Statistical
Longitudinal Studies
Computer Simulation

Word Cloud

Created with Highcharts 10.0.0datalongitudinalobservationfunctionalanalysistimesestimationinformativemethodpenalizedsplinesrealmayresultsfunctioncanestimatedappropriatelyviaprocesstypicallyassumednoninformativeoftenviolatedapplicationsThusmethodsfailaccountdependenceoutcomesresultbiasedfindgeneralclasssharedrandomeffectmodelscommonlyusedleadinconsistentmodelanotherconsistentevenrate-optimalIndeedshowmeancovariancetensor-productspecificchoicesparametersproposedtheoreticalprovidedsimulationstudiesconducteddemonstrateits performanceFunctional

Similar Articles

Cited By (1)