ISYE 6501 Introduction to Analytic
Model Homework 5 Georgia
Institute of Technology.
, lOMoARcPSD| 43283024
isye6501_homework5
2024-06-19
Question 11.1
1) Backward Elimination
The backward elimination process started with all predictor variables and iteratively removed the least
significant variables based on the Akaike Information Criterion (AIC). The final model after backward
elimination includes the following predictors:
M Ed Po1 M.F U1 U2 Ineq Prob
The AIC for this model was 503.93.
Forward Selection The forward selection method also started with all predictor variables and included vari-
ables iteratively to minimize the AIC. The final model selected by forward selection contains the same
variables as the backward elimination model.
Stepwise Selection The stepwise selection method combined both forward and backward selection procedures
to minimize the AIC. The final model from stepwise selection included:
M Ed Po1 M.F U1 U2 Ineq Prob
This model also had an AIC of 503.93, identical to the backward and forward models.
2) LASSO (Least Absolute Shrinkage and Selection Operator)
LASSO was used to perform variable selection and regularization to enhance the prediction accuracy and
interpretability of the statistical model. The standardized predictors (x) and the standardized response
(y) were used in the model. The glmnet package was used to fit the LASSO model, and cross-validation
(cv.glmnet) was performed to select the optimal lambda that minimized the mean cross-validated error.
After plotting the LASSO path and selecting the optimal lambda using cross-validation, the coefficients of
the final model can be extracted. The significant predictors typically shrink towards zero with LASSO,
leading to a simpler model with fewer predictors.
Based on the model selection procedures (backward elimination, forward selection, and stepwise selection),
the significant predictors for the crime rate are:
M Ed Po1 M.F U1 U2 Ineq Prob
3) ELASTIC NET
The output of the provided R script indicates that the Elastic Net model was tuned and evaluated for
different values of the mixing parameter alpha to identify the best performing model based on the mean
squared error. Train the data with cross-validation then evaluated it on test data. Then I identified the best
model by detecting the lowest MSE which was when Alpha = 0.8
, lOMoARcPSD| 43283024
rm(list=ls())
set.seed(123)
data <- read.table("uscrime.txt", stringsAsFactors = FALSE, header = TRUE)
backwards_model <- lm(Crime~., data=data)
backwards_model <- step(backwards_model, direction ="backward")
Stuvia customers have reviewed more than 700,000 summaries. This how you know that you are buying the best documents.
Quick and easy check-out
You can quickly pay through credit card or Stuvia-credit for the summaries. There is no membership needed.
Focus on what matters
Your fellow students write the study notes themselves, which is why the documents are always reliable and up-to-date. This ensures you quickly get to the core!
Frequently asked questions
What do I get when I buy this document?
You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.
Satisfaction guarantee: how does it work?
Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.
Who am I buying these notes from?
Stuvia is a marketplace, so you are not buying this document from us, but from seller saraciousstuvia. Stuvia facilitates payment to the seller.
Will I be stuck with a subscription?
No, you only buy these notes for $15.99. You're not tied to anything after your purchase.