Data Pipeline & Model Training
Cleaned raw dataset (missing values, outliers, normalization)
Applied one-hot encoding and feature scaling for categorical variables
Split data into train-test subsets with cross-validation
Trained a linear regression model with R² of 0.85