On a Robust MaxEnt Process Regression Model with Sample-Selection.

Hea-Jung Kim, Mihyang Bae, Daehwa Jin
Author Information
  1. Hea-Jung Kim: Department of Statistics, Dongguk University-Seoul, Pil-Dong 3Ga, Chung-Gu, Seoul 100-715, Korea.
  2. Mihyang Bae: Department of Statistics, Dongguk University-Seoul, Pil-Dong 3Ga, Chung-Gu, Seoul 100-715, Korea.
  3. Daehwa Jin: Department of Statistics, Dongguk University-Seoul, Pil-Dong 3Ga, Chung-Gu, Seoul 100-715, Korea.

Abstract

In a regression analysis, a sample-selection bias arises when a dependent variable is partially observed as a result of the sample selection. This study introduces a Maximum Entropy (MaxEnt) process regression model that assumes a MaxEnt prior distribution for its nonparametric regression function and finds that the MaxEnt process regression model includes the well-known Gaussian process regression (GPR) model as a special case. Then, this special MaxEnt process regression model, i.e., the GPR model, is generalized to obtain a robust sample-selection Gaussian process regression (RSGPR) model that deals with non-normal data in the sample selection. Various properties of the RSGPR model are established, including the stochastic representation, distributional hierarchy, and magnitude of the sample-selection bias. These properties are used in the paper to develop a hierarchical Bayesian methodology to estimate the model. This involves a simple and computationally feasible Markov chain Monte Carlo algorithm that avoids analytical or numerical derivatives of the log-likelihood function of the model. The performance of the RSGPR model in terms of the sample-selection bias correction, robustness to non-normality, and prediction, is demonstrated through results in simulations that attest to its good finite-sample performance.

Keywords

References

  1. Front Hum Neurosci. 2017 Jan 11;10:647 [PMID: 28123359]

Word Cloud

Created with Highcharts 10.0.0modelregressionprocesssample-selectionMaxEntbiasGaussianRSGPRsampleselectionfunctionGPRspecialrobustpropertieshierarchicalBayesianmethodologyMarkovchainMonteCarloperformancecorrectionanalysisarisesdependentvariablepartiallyobservedresultstudyintroducesMaximumEntropyassumespriordistributionnonparametricfindsincludeswell-knowncaseiegeneralizedobtaindealsnon-normaldataVariousestablishedincludingstochasticrepresentationdistributionalhierarchymagnitudeusedpaperdevelopestimateinvolvessimplecomputationallyfeasiblealgorithmavoidsanalyticalnumericalderivativeslog-likelihoodtermsrobustnessnon-normalitypredictiondemonstratedresultssimulationsattestgoodfinite-sampleRobustProcessRegressionModelSample-Selection

Similar Articles

Cited By