Employment relations: a data driven analysis of job markets using online job boards and online professional networks

less than 1 minute read

Career websites contain valuable data on employees, their skill sets and, employment history. This article uses k-means clustering on keywords describing skill sets that have been transformed using either

term frequency inverse document frequency (TF-IDF) or
t-distributed stochastic neighbor embedding (TSNE).

A third experiment performs the clustering after 20 keywords have been selected using Latent Dirichlet Allocation.

In addition the authors extract the chronological information about positions to visualize potential career paths.

Insights

clustering jobs per title yields job titles commonly used for different kind of work (e.g. web developer, business intelligence, oracle development, etc.)
the network generated from the chronological information shows (i) typical career paths and (ii) identifies positions with high in- and out-degrees.

Share on

Twitter Facebook LinkedIn

Albert Weichselbraun

Employment relations: a data driven analysis of job markets using online job boards and online professional networks

Insights

Share on

You may also enjoy

Big, Linked Geospatial Data and Its Application in Earth Observation

Suffix array

Dynamic feature scaling for online learning of binary classifiers

40 years of suffix trees