LINKEDIN JOBS ANALYSIS

This repository presents an end-to-end analysis of LinkedIn's professional networking platform's Jobs section. The project aims to extract over 500 job details from LinkedIn's website using the Python library, Selenium, and organize the information into specific formats by creating three tables. Additionally, the project involved data cleaning and exploratory data analysis using Pandas and Numpy libraries. MySQL and MS-Excel were utilized to derive insights from the dataset. Power BI was used to visualize the results. Finally, a web page was created to present relevant job information based on the skills listed in the dataset. The page was hosted on the cloud for wider access.

User's Manual

Files	Description
Data-Cleaning	This folder contains files and scripts specifically dedicated to the process of cleaning raw data
Data-Scrapping	This folder contains files and scripts designed to extract data from Linekdin's Jobs Section
Data-Visualization	This folder contains files and scripts that focus on creating visual representations of data in PowerBI
Dataset	The dataset folder serves as a central location for storing and managing the data used in the project
Exploratory Data Analysis	This folder contains files and scripts dedicated to exploring the patterns present in a given dataset
Web-Page	This folder contains files and resources related to the development of a web page or website
README.md	This is a readme file of this repositary

Tools & Technology Used:

Methodology:

Implemented web scraping on the LinkedIn jobs section using the Python library Selenium. Leveraged its capabilities to extract and retrieve the following attributes from the job listings:

Attribute	Feature's Meaning
`location`	The location of the job
`designation`	The designation of the job
`name`	Name of the company
`industry`	Industry in which the company operates
`employees_count`	Count of employees
`linkedin_followers`	Number of followers on linkedin
`involvement`	the nature of involvement in the job, for instance: Full-time, part-time
`level`	The seniority level like Mid-Senior level
`total_applicants`	total number of applicants
`Skills`	Skills required for the job

Utilized Pandas to perform data cleaning and exploratory data analysis (EDA), and seamlessly imported the CSV files into MySQL for comprehensive analysis and further insights extraction.
Skillfully employed various SQL clauses such as GROUP BY, ORDER BY, HAVING, CASE-WHEN and more to manipulate the data, enabling in-depth analysis and extraction of valuable insights.
Demonstrated proficiency by developing a dynamic web page showcasing diverse job listings categorized based on distinct skill sets, enhancing accessibility and user experience.

Data Cleaning ( few code snippets )

1. Removing Noise from Employee_count Column

2. Taking Median of Linkedin_followers for different Companies

3. Extracting skills(keywords) from job_description coloumn

Results/Insights:

The visual provides an overview of job opportunities that specifically require a particular skill set. It showcases the correlation between specific skills and the corresponding job positions available in the market. The visual aims to highlight the demand for expertise in certain areas and serves as a valuable resource for individuals seeking employment opportunities aligned with their skill sets.

plt.figure(figsize=(20,10),dpi=100)

chart = sns.barplot(x=skill_df["Skill"],y=skill_df["Number of jobs requiring the skill"],palette="icefire")

plt.title("Number of jobs that require a particular skill",size=25)

plt.ylabel("Number of jobs",size=22)

plt.xlabel("Skill",size=22)

plt.xticks(fontsize=18,rotation=90)

plt.yticks(fontsize=18)

chart.bar_label(chart.containers[0],size=18)

------------------------------------------------------------------------------------------------------------------------------------

The visual displays the distribution of job availability across different classes, providing insights into the number of job opportunities within each category. It offers a clear understanding of the relative abundance or scarcity of jobs in various classes, aiding individuals in identifying areas with higher or lower job prospects.

plt.figure(figsize=(20,10),dpi=100)

ax = sns.countplot(df["Class"],palette="viridis")

plt.title("Job Openings by Class",size=25)

plt.ylabel("Number of job openings",size=20)

plt.xlabel("Class",size=20)

plt.xticks(fontsize=20) plt.yticks(fontsize=20) ax.bar_label(ax.containers[0],size=20)

------------------------------------------------------------------------------------------------------------------------------------

This visual highlights job openings categorized based on their involvement nature. It provides insights into the levels of engagement and responsibility required for various positions, helping individuals identify roles that align with their desired involvement level.

plt.figure(figsize=(13,6), dpi=200) chart = sns.countplot(df["Involvement"],palette='ocean_r')

plt.title("Job Openings for different involvement natures",size=25)

plt.ylabel("Number of job openings",size=20)

plt.xlabel("Involvement nature",size=20)

chart.bar_label(chart.containers[0],size=13)

An interactive Power BI dashboard has been developed to consolidate data from multiple sources. It showcases visually engaging charts, graphs, and tables, enabling users to explore key metrics and extract valuable insights. The dashboard enhances data-driven decision-making and facilitates effective communication with stakeholders.

Conclusions:

Most jobs are being posted by small-size Companies but applicants are apprently applying in Large-size Companies.
Most applicants are applying in Chennai based companies, but more jobs are there in Mumbai and Bangalore based companies
Significantly higher number of applicants in large and small size companies as compared to medium size companies
Top three states with maximum job openings are Tamil Nadu, Karnataka and Maharastra.
Digital marketing has the most opportunities
About half of the total openings are in IT industry
AI is most required skills.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LINKEDIN JOBS ANALYSIS

User's Manual

Tools & Technology Used:

Methodology:

Data Cleaning ( few code snippets )

1. Removing Noise from Employee_count Column

2. Taking Median of Linkedin_followers for different Companies

3. Extracting skills(keywords) from job_description coloumn

Results/Insights:

Conclusions:

Check out live Demo of the Project: Linkedin_Jobs_analysis

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
Data-Cleaning		Data-Cleaning
Data-Scrapping		Data-Scrapping
Data-Visualization		Data-Visualization
Dataset		Dataset
Exploratory Data Analysis		Exploratory Data Analysis
Web-Page		Web-Page
README.md		README.md

singhrahul7874/LinkedinJobAnalysis

Folders and files

Latest commit

History

Repository files navigation

LINKEDIN JOBS ANALYSIS

User's Manual

Tools & Technology Used:

Methodology:

Data Cleaning ( few code snippets )

1. Removing Noise from Employee_count Column

2. Taking Median of Linkedin_followers for different Companies

3. Extracting skills(keywords) from job_description coloumn

Results/Insights:

Conclusions:

Check out live Demo of the Project: Linkedin_Jobs_analysis

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages