Doubly robust calibration of prediction sets under covariate shift.

Yachong Yang, Arun Kumar Kuchibhotla, Eric Tchetgen Tchetgen
Author Information
  1. Yachong Yang: Department of Statistics & Data Science, University of Pennsylvania, Philadelphia, PA, USA.
  2. Arun Kumar Kuchibhotla: Department of Statistics & Data Science, Carnegie Mellon University, Pittsburgh, PA, USA. ORCID
  3. Eric Tchetgen Tchetgen: Department of Statistics & Data Science, University of Pennsylvania, Philadelphia, PA, USA.

Abstract

Conformal prediction has received tremendous attention in recent years and has offered new solutions to problems in missing data and causal inference; yet these advances have not leveraged modern semi-parametric efficiency theory for more efficient uncertainty quantification. We consider the problem of obtaining well-calibrated prediction regions that can data adaptively account for a shift in the distribution of covariates between training and test data. Under a covariate shift assumption analogous to the standard missing at random assumption, we propose a general framework based on efficient influence functions to construct well-calibrated prediction regions for the unobserved outcome in the test sample without compromising coverage.

References

  1. Proc Natl Acad Sci U S A. 2023 Feb 7;120(6):e2214889120 [PMID: 36730196]
  2. Biometrics. 2005 Dec;61(4):962-73 [PMID: 16401269]
  3. Stat Sci. 2007;22(4):569-573 [PMID: 18516239]
  4. J R Stat Soc Series B Stat Methodol. 2023 Jul 17;85(5):1680-1705 [PMID: 38312527]
  5. J Am Stat Assoc. 2013;108(501):278-287 [PMID: 25237208]
  6. Ann Stat. 2017 Oct;45(5):1951-1987 [PMID: 30971851]

Word Cloud

Created with Highcharts 10.0.0predictiondatashiftmissingefficientwell-calibratedregionstestcovariateassumptionConformalreceivedtremendousattentionrecentyearsofferednewsolutionsproblemscausalinferenceyetadvancesleveragedmodernsemi-parametricefficiencytheoryuncertaintyquantificationconsiderproblemobtainingcanadaptivelyaccountdistributioncovariatestraininganalogousstandardrandomproposegeneralframeworkbasedinfluencefunctionsconstructunobservedoutcomesamplewithoutcompromisingcoverageDoublyrobustcalibrationsets

Similar Articles

Cited By