-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Filling histograms twice fails in 2023.9.1 only #363
Comments
@agoose77 since you worked on the pickle stuff I'm going ahead and pinging you here :) I'll still take a look at investigating at some point today |
@douglasdavis @agoose77 Also I would like to note that I've been encountering some strange behavior in 2023.9.1 in general. events = NanoEventsFactory.from_root(
{"root_files/Egamma1.root": "Events"},
permit_dask=True,
chunks_per_file=10,
).events()
def filter_events(events, pt):
good_events = events[dak.num(events.Electron) >= 2]
abs_eta = abs(events.Electron.eta)
pass_eta_ebeegap = (abs_eta < 1.4442) | (abs_eta > 1.566)
pass_tight_id = events.Electron.cutBased == 4
pass_pt = events.Electron.pt > pt
pass_eta = abs_eta <= 2.5
pass_selection = pass_pt & pass_eta & pass_eta_ebeegap & pass_tight_id
n_of_tags = dak.sum(pass_selection, axis=1)
good_events = events[n_of_tags >= 2]
good_locations = pass_selection[n_of_tags >= 2]
return good_events
client = Client()
x = filter_events(events, 31)
dask.compute(x) runs into some weird
only on some root files, while on others it does not. All this disappears in 2023.9.0 |
At the highest level it looks like the pickle stuff (made it into 2023.9.0) combined with the turning back on of form rehydration (made it in to 2023.9.1) is leading to an error associated with attempting to serialize a PlaceholderArray. I was able to see this without histogramming: import uproot
import dask_awkward as dak
events = uproot.dask({"/path/to/coffea/tests/samples/nano_dy.root": "Events"})
p1 = events.Electron_pt[:20]
p2 = events.Electron_pt[20:]
h.compute(p1, p2, scheduler="processes") |
Closing this one as fixed by #366 and scikit-hep/awkward#2714 |
This fails with
Client()
or"processes"
scheduler .With
"processes"
it gives:while with
Client()
it gives:The text was updated successfully, but these errors were encountered: