HR Analysis Using R
Project Overview:
This project explores the factors influencing employee attrition using the dataset HR-Employee-Attrition.csv. The goal was to identify patterns and statistical relationships between demographic, work-related, and income variables, and to assess their impact on attrition.
Objectives:
Data Import & Exploration
The dataset was loaded into R using:
To gain an initial understanding of relationships between continuous variables, a correlation matrix was computed:
Key Insight:
Data Visualization:
Scatterplot Matrix
This visualization revealed:
Boxplot: Age vs. Attrition
Observation:
Statistical Testing
T-Test: Age and Attrition
Result:
T-Test: Employee Number and Attrition
While EmployeeNumber is an ID field and not inherently meaningful, the test was run to confirm no systematic bias in ID assignment related to attrition. No significant difference was found.
Regression Modeling
Model 1: Monthly Income ~ Age
Model 2: Monthly Income ~ Age + Total Working Years
Key Insights & Conclusions
Thank you for taking the time to review my work. If you have any questions, please don’t hesitate to reach out. I would also greatly appreciate it if you could share any relevant job opportunities, or connect me with others who may be looking for a data analytics professional.