-
Notifications
You must be signed in to change notification settings - Fork 2
/
README.rmd
85 lines (51 loc) · 1.79 KB
/
README.rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
---
output: github_document
---
[![CRAN status](https://www.r-pkg.org/badges/version/DSjobtracker)](https://CRAN.R-project.org/package=DSjobtracker)
# DSjobtracker
<img src="man/figures/hexsticker.png" align="right" height="200"/>
## What skills and qualifications are required for data science related jobs?
The DSjobtraker package compiles data from 1172 job advertisements in data science and statistics, making it an invaluable tool for learning key skills required in these areas. Here is a word cloud of 97 requested skills mentioned in the job advertisements. It provides insights into the most sought-after skills in the field.
```{r, echo=FALSE, warning=FALSE, message=FALSE}
library(ggwordcloud)
library(DSjobtracker)
word <- colnames(DStidy[, 7:94])
word <- gsub("_", " ", word)
frequency <- colSums(DStidy[, 7:94], na.rm=TRUE)
df <- data.frame(word = word,
freq = frequency)
ggplot(data = df,
aes(label = word, size = freq, col = as.character(freq))) +
geom_text_wordcloud(rm_outside = TRUE, max_steps = 1,
grid_size = 1, eccentricity = .9)+
scale_size_area(max_size = 20)+
# scale_color_brewer(palette = "Paired", direction = -1)+
theme_void()
```
# Installation
You can install the **development** version from [GitHub](https://github.com/thiyangt/DSjobtracker):
```r
#install.packages("devtools")
devtools::install_github("thiyangt/DSjobtracker")
library(DSjobtracker)
```
```{r, echo=FALSE}
library(DSjobtracker)
```
# Glimpse of tidy data
```{r, comment=NA}
tibble::glimpse(DStidy)
```
# 2021 survey data
```{r, comment=NA}
df2021 <- get_data(2021)
df2021
```
# Preview of the tidy version of the dataset
```{r, comment=NA}
head(DStidy)
```
# Preview of the untidy version of the dataset
```{r, comment=NA}
head(DSraw)
```