Predictive Modeling Dataset
March 5, 2025

The dataset is a .csv file containing data for 500 records. There are 8 predictor variables (x1, x2, …, x8) and one response variable (y). Use whichever pre-processing, visualization and modeling techniques you like in R to describe which predictors have the largest influence on the response variable. Be as specific as possible when describing any relationships you find.
In addition to reporting on which variables show the strongest relationships with the response, explain how you would validate your model and assess its predictive power.
Our expectation is that you will provide a short write up, along with all of the R code and visualizations you used to arrive at your conclusions.
The data you’ll get:
