Download PDF

Biometrika

Publication date: 2018-09-01
Volume: 105 Pages: 645 - 664
Publisher: Oxford University Press (OUP)

Author:

Charkhi, Ali
Claeskens, Gerda

Keywords:

Akaike information criterion, Confidence region, Likelihood model, Model selection, Post-selection inference, Science & Technology, Life Sciences & Biomedicine, Physical Sciences, Biology, Mathematical & Computational Biology, Statistics & Probability, Life Sciences & Biomedicine - Other Topics, Mathematics, MODEL-SELECTION, CONFIDENCE-INTERVALS, PREDICTION REGIONS, VARIABLE SELECTION, ESTIMATORS, UNIFORM, REGRESSION, 0103 Numerical and Computational Mathematics, 0104 Statistics, 1403 Econometrics, 3802 Econometrics, 4905 Statistics

Abstract:

Ignoring the model selection step in inference after selection is harmful. This paper studies the asymptotic distribution of estimators after model selection using the Akaike information criterion. First, we consider the classical setting in which a true model exists and is included in the candidate set of models. We exploit the overselection property of this criterion in the construction of a selection region, and obtain the asymptotic distribution of estimators and linear combinations thereof conditional on the selected model. The limiting distribution depends on the set of competitive models and on the smallest overparameterized model. Second, we relax the assumption about the existence of a true model, and obtain uniform asymptotic results. We use simulation to study the resulting postselection distributions and to calculate confidence regions for the model parameters. We apply themethod to data.