-
Notifications
You must be signed in to change notification settings - Fork 3
/
DESCRIPTION
45 lines (45 loc) · 1.65 KB
/
DESCRIPTION
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
Type: Package
Package: tidylo
Title: Weighted Tidy Log Odds Ratio
Version: 0.2.0.9000
Authors@R: c(
person("Tyler", "Schnoebelen", , "tjs1976@gmail.com", role = "aut"),
person("Julia", "Silge", , "julia.silge@gmail.com", role = c("aut", "cre", "cph"),
comment = c(ORCID = "0000-0002-3671-836X")),
person("Alex", "Hayes", , "alexpghayes@gmail.com", role = "aut",
comment = c(ORCID = "0000-0002-4985-5160"))
)
Description: How can we measure how the usage or frequency of some
feature, such as words, differs across some group or set, such as
documents? One option is to use the log odds ratio, but the log odds
ratio alone does not account for sampling variability; we haven't
counted every feature the same number of times so how do we know which
differences are meaningful? Enter the weighted log odds, which
'tidylo' provides an implementation for, using tidy data principles.
In particular, here we use the method outlined in Monroe, Colaresi,
and Quinn (2008) <doi:10.1093/pan/mpn018> to weight the log odds ratio
by a prior. By default, the prior is estimated from the data itself,
an empirical Bayes approach, but an uninformative prior is also
available.
License: MIT + file LICENSE
URL: https://juliasilge.github.io/tidylo/,
https://github.com/juliasilge/tidylo
BugReports: https://github.com/juliasilge/tidylo/issues
Imports:
dplyr,
rlang
Suggests:
covr,
ggplot2,
janeaustenr,
knitr,
rmarkdown,
stringr,
testthat (>= 3.0.0),
tidytext
VignetteBuilder:
knitr
Config/testthat/edition: 3
Encoding: UTF-8
Roxygen: list(markdown = TRUE)
RoxygenNote: 7.1.2