WIP: Energy score estimators #29

simon-hirsch · 2024-05-31T10:58:38Z

Hi @sallen12 this is for #25, early-stage work in progress but I'm happy about early feedback.

The gufuncs look good in my opinion, the backend based functions are a bit harder. For numpy, it's quite straight-forward to just np.permutation or to subset by a sampled index. For the other backends, I'm not too sure what is the appropriate way to to shuffle the ensemble members once. Do you got an idea here?

frazane · 2024-05-31T12:15:37Z

I would put the permutation under the public API function so it's more transparent, and since it can be quite expensive I would also give the option to skip it. Also, I think using np.random.shuffle (with the transposed array if we want it applied to the last dimension) should be more performant because it operates in-place without copies, but it's something worth testing with a micro benchmark.

We need to add shuffle to the backends 👍

simon-hirsch · 2024-06-03T07:02:46Z

Adding it to the backend seems like a good option. Will have a look at shuffle vs permutation, but also how torch and tf are implementing these. We could also have something like:

B.shuffle(array, axis, inplace=False)

so that we avoid any confusion about in-place operations and not-in-place operations.

Won't find time this week, but the week after 👍🏽

simon-hirsch · 2024-06-14T20:40:26Z

So, roll is quite straight-forward, but shuffle is actually tricky because we need to think about setting a seed.

We can set the seed in the function call, i.e. shuffle(x, axis, seed=123) or somewhere in the backend, i.e. as self.seed=123 and then use it from there - I'm slightly inclined to have it in the function call (more explicit and usually you evaluate forecasts only once), but it would get tricky at the point a function is called more than once. @frazane what do you think?

I acknowledge that this current version is somewhat inconsistent because I did jax last. All other packages allowed me to ignore the issue in this first draft :D

frazane · 2024-06-18T08:47:56Z

I would also be in favor of having the seed in the function call.

simon-hirsch · 2024-06-18T20:59:55Z

Hi there, here we go:

Move setting the seed to backend.shuffle(..., seed)
Adjust backend and gufunc scoring rules
Add a few sentences in the docs
Add a simple test, that assures that the iid estimator is <10% worse than full sample and k-band estimator with $k=100$ is less than 5% worse than full sample. I admit that these boundaries are fully made up on the fly.

I think we're ready to roll - feedback and criticism welcome.

PS: Actually, the last point got me thinking. At some point, showing the speed to estimation precision trade-off might be an interesting case study for an example notebook.

simon-hirsch · 2024-09-18T11:17:08Z

Hi @sallen12 @frazane, any feedback on this? :)

= added 2 commits May 31, 2024 12:48

Add gufunc for energy score iid and k-band estimators

eb9508b

Add IID estimator score backend function

d06b068

simon-hirsch and others added 2 commits June 14, 2024 20:59

Merge branch 'frazane:main' into energy_estimators

2f8f9ca

Add roll and permutation

0c5bb35

simon-hirsch and others added 10 commits June 18, 2024 19:03

Set seed on function call for base, numpy, jax.

54cff93

Add seed to shuffle for tensorflow and torch

e1773f9

Add backend ES iid and k-band

d5c6a5d

Remove shuffle from gufuncs

54ed2fd

Merge branch 'frazane:main' into energy_estimators

34828f5

Add to init

c1baf92

Add energy score estimators to public API.

8d8f37e

Add docs for energy estimators

996421b

Fix typo

5a7b1b9

Add test for energy score estimators

9cb05aa

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Energy score estimators #29

WIP: Energy score estimators #29

simon-hirsch commented May 31, 2024

frazane commented May 31, 2024

simon-hirsch commented Jun 3, 2024 •

edited

Loading

simon-hirsch commented Jun 14, 2024

frazane commented Jun 18, 2024

simon-hirsch commented Jun 18, 2024

simon-hirsch commented Sep 18, 2024

WIP: Energy score estimators #29

Are you sure you want to change the base?

WIP: Energy score estimators #29

Conversation

simon-hirsch commented May 31, 2024

frazane commented May 31, 2024

simon-hirsch commented Jun 3, 2024 • edited Loading

simon-hirsch commented Jun 14, 2024

frazane commented Jun 18, 2024

simon-hirsch commented Jun 18, 2024

simon-hirsch commented Sep 18, 2024

simon-hirsch commented Jun 3, 2024 •

edited

Loading