Ridge regression in the logit model: an application to birth weight data
This post presents a short application of the ridge regression method to the modeling of birth weights of infants. It is based on joint work with V. Kazakova . The logistic model employs an infamous data set by Lee & Scott (1986), known for its collinearity issues (see Seber & Wild, 1989, p.104ff.) Background Ridge regression (Hoerl, 1962) is an estimation method for models with strong covariate correlation. An accessible introduction to ridge estimation in linear models was written by M. Taboga on StatLect . A broad-view introduction is Hastie (2010). Ridge works by adding a penalty term to the sum-of-squared-residuals (SSR) minimization problem in linear regression. For generalized linear models (GLM) we may instead penalize the log-likelihood function . Assume the target vector $y$ to be distributed with density $f(Y, \beta)$, where $\beta$ is the vector of coefficients in the linear predictor part of the GLM. The maximum likelihood estimate of $\beta$ is found by solving...