Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Trouble with definition of "D" in val.prob. #146

Open
jthaman opened this issue Jun 19, 2024 · 1 comment
Open

Trouble with definition of "D" in val.prob. #146

jthaman opened this issue Jun 19, 2024 · 1 comment

Comments

@jthaman
Copy link

jthaman commented Jun 19, 2024

I'm trying to understand what "D" means in the val.prob function. I read in your technical report that.

" The index of discrimination is derived by computing the difference in quality of the best constant predictor (one that on the average correctly predicts the overall prevalence of the event) and the best calibrated predictor:

D = [L(a, 0) - L (a,b) - 1] / n".

I have tried to code this up myself in R, but i can't match your output from val.prob.

Here is an example that i coded up :

# example data
y.test <- c(0, 1, 0, 0, 1, 1)
p.sim <- c(0.5, 0.8, 0.1, 0.2, 0.4, 0.7)
n <- length(y.test)

cal.fit <- glm(y.test ~ qlogis(p.sim), family = binomial)
Lab <- -2 * logLik(cal.fit); Lab

cal.fit01 <- glm(y.test ~ 0, data = dat, family = binomial, offset = qlogis(p.sim))
L01 <- -2 * logLik(cal.fit01); L01

cal.fita0 <- glm(y.test ~ 1, data = dat, family = binomial)
La0 <- -2 * logLik(cal.fita0); La0

cal.fita1 <- glm(y.test ~ 1, data = dat, family = binomial, offset = qlogis(p.sim))
La1 <- -2 * logLik(cal.fita1); La1

U <- (L01 - Lab - 2) / n; U # matches val.prob
Up <- (L01 - La1 - 1) / n; Up # not used in val.prob
Us <- (La1 - Lab - 1) / n; Us # not used in val.prob

D.paper <- (La0 - Lab - 1) / n; D.paper # does not match
D.code <- (La0 - L01 - 1) / n; D.code # matches val.prob

val.prob(p.sim, y.test)

D.code is D as best I can tell from reading the source code of val.prob.s, which is (I believe) that same as [L(a, 0) - L (0, 1) - 1] / n". D.paper is the D in the technical report. So, which D is the real D? Thanks! i think this stuff is so cool.

@harrelfe
Copy link
Owner

Thanks for working on this and sorry to take so long in replying. I’m sorry I don’t have time to dig into this. If you get time to examine the val.prob code line by line and comparing with what you are doing, it may shed some light, and may possible lead to a specify bug correction I need to make.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants