In this project, I am analyzing hiring process data to gain insights from about records of previous hires within a multinational company. By analyzing this data, I am aiming to uncover valuable trends and information about the company's hiring process, which can contribute to making informed decisions and improvements for the future.
As a data analyst data wrangling is important. This involves handling missing data, simplifying the dataset, detecting and dealing with outliers, and summarizing your findings. Therefore, the main goal is to provide meaningful conclusions that can help the company enhance its hiring process and decision-making.
The project repository is organized as follows:
├── LICENSE
├── README.md <- README .
├── reports <- Folder containing the final reports/results of this project.
│ │
│ └── Report.pdf <- Final analysis report in PDF.
│
├── src <- Source for this project.
│ │
│ └── data <- Datasets used and collected for this project.
Inspecting the dataset for any missing values and determine the most suitable strategy for handling them. This might involve filling in missing values with appropriate alternatives or removing records with missing data, depending on the situation.
Simplifying the dataset by combining categories within columns, where applicable. This step can streamline your analysis and help in drawing more accurate conclusions.
Identifying potential outliers in the dataset that could skew your analysis. Deciding on the appropriate approach for dealing with these outliers, whether it's removing them, replacing them with more representative values, or leaving them as they are.
After cleaning and preparing the data, summarizing the findings. Calculating relevant statistical measures such as averages, medians, and other metrics. Utilizing visualizations such as charts and graphs to enhance the understanding of the data.
To tackle the project, follow these data analytics tasks using Excel:
Determining the gender distribution of hires. Calculating the number of males and females that have been hired by the company.
Calculating the average salary offered by the company using Excel functions.
Creating class intervals for the salaries in the company. This will help in visualizing and understanding the distribution of salaries.
Utilizing a suitable visualization, such as a pie chart or bar graph, to represent the proportion of employees working in different departments.
Using a chart or graph to illustrate the distribution of different position tiers within the company.
-
Most hired individuals are males, 6% prefer not to state gender.
-
Avg. Total Salary offered without Outliers is $49878
-
Avg. Total Salary offered with Outliers is $49976
-
2892 employees had salary below $60000 out of total 4697 hired.
-
1802 employees had salary above $60000 and below $100000 out of total 4697 hired.
-
Operation department had highest hires with 1843, followed by Service department with 1332 .
-
Same trend for department for all non hired individual as well.
-
Post c5 & c9 had the highest hired individuals, majority were males.
-
Post m6, m7, n6, n9, n10 were almost non existence.
-
Post c5 & c9 had majority of employees with salary below $40000.
-
Post c5 & c9 employees mostly belonged to Operations Department.
-
Service department also followed the same trends.
This project is licensed under the MIT License.
- ©2023 Tushar Aggarwal. All rights reserved
- Medium
- Tushar-Aggarwal.com
- New Kaggle
If you have any questions, suggestions, or just want to say hello, you can reach out to us at Tushar Aggarwal. We would love to hear from you!