Built a complete PCA + ML pipeline on a student performance dataset (395 rows, 33 features). After cleaning, standardizing numeric variables, and encoding categorical fields, I explored relationships with correlation and study-habit vs grade visualizations. Then I implemented PCA end-to-end (covariance matrix → eigenvalues/eigenvectors, scree plots, biplots, and transformation dashboards) to understand variance and reduce dimensionality. Finally, I trained an SVM classifier on the top 5 principal components to predict Pass vs Fail, comparing kernels—best result: Linear SVM, 94.94% test accuracy. #Python #PCA #MachineLearning #SVM #DataScience #scikitlearn #AICadmey

To view or add a comment, sign in

Explore content categories