Demystifying Salaries: A Data Science Approach to Predicting Salary Ranges

jadavvineet73 46 views 14 slides May 21, 2024
Slide 1
Slide 1 of 14
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14

About This Presentation

This presentation dives into the world of data science and explores its application in predicting salary ranges. We'll uncover the secrets hidden within data sets, unveil the power of machine learning algorithms, and shed light on factors that influence salaries in today's job market.
Visit ...


Slide Content

The "Salary Prediction Dataset" is a synthetic dataset generated for the purpose of exploring salary prediction tasks. It contains simulated data reflecting various factors influencing salary levels such as education, experience, location, job title, age, and gender. This dataset can be utilized for predictive modeling tasks to estimate salaries based on these factors.

Data Information

Data Insights

EDA Title distribution Withing Manager, Director, Analyst, Engineer. Which shows Count of Manager and Director are high followed by Analyst and Engineer “Gender Relationship” that shows the distribution of people by gender. The pie chart is divided into two slices, colored blue and orange. The blue slice, labeled "Male", represents 51.60% of the population and the orange slice, labeled "Female", represents 48.40% of the population.

The salary distribution for all four job titles is positively skewed. This means that there are more people towards the lower end of the salary range. Managers have the lowest median salary among the four job titles. Directors have a higher median salary than managers. Analysts have a median salary that is lower than directors but higher than managers. Engineers have the highest median salary among the four job titles..

Education vs. Salary: There is a slight tendency for people with higher education levels to have higher salaries. Experience vs. Salary: there is a clearer tendency for people with more experience to have higher salaries. Age vs. Salary: there is a slight tendency for people with higher age to have higher salaries. Location vs. Salary: For some locations, there is a weak positive correlation, while for others there is no correlation.

Label Encoding

Dropping Unnecessary Columns

Splitting Data into Dependent and Independent Variable

Applying Linear Regression Algorithm Dividing data into Train and Test

Applying Random Forest Regressor Algorithm & Decision Tree Regressor

Applying Random Forest Regressor Algorithm

Since the score given by Random forest regressor is on higher side. We will go with Random forest regressor