Variable selection in multiple linear regression: The influence of individual cases

  • SJ Steel Department of Statistics and Actuarial Science, Stellenbosch University
  • DW Uys Department of Statistics and Actuarial Science, Stellenbosch University

Abstract

The influence of individual cases in a data set is studied when variable selection is applied in multiple linear regression. Two different influence measures, based on the C_p criterion and Akaike's information criterion, are introduced. The relative change in the selection criterion when an individual case is omitted is proposed as the selection influence of the specific omitted case. Four standard examples from the literature are considered and the selection influence of the cases is calculated. It is argued that the selection procedure may be improved by taking the selection influence of individual data cases into account.
Published
2007-12-01
Section
Research Articles