Skip to content

Analysis of latent features job descriptions and their correlation with salaries

Notifications You must be signed in to change notification settings

nalsabhan/DS-Jobs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 

Repository files navigation

Job Descriptions Analysis

Overview

This project intend to analyze latent features (topics) in job descriptions. It answers questions such as how valuable a skill-set is, how much the value varies across companies and What topics or skills contribute to high salaries. Average salaries per topic has been used to determine the value of topics or skills. Jobs are ranked for each topic allowing to determine the relevance strength of skills for each topic.

Data

The data has been collected from different sources and merged for matching job titles.

Data sources:

The Dataset consist of 15724 jobs for 12 companies and 88334 salaries.

image

Latent Feature example

The generated matrixes from NMF rank jobs per latent feature and words per the same latent feature. This makes each latent feature identifiable by the words it ranks highly. The following are examples of some latent features:

Customer services:

The following image, illustrates the top 400 words and their sizes represent their word rank for the this feature: image

image

The X-axes represent the strength/weight of this latent feature (Customer services) per job, while y-axis is the corresponding average salary. It shows that the more the job description is relevant to customer services the lower the average salaries.

About

Analysis of latent features job descriptions and their correlation with salaries

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published