Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DNN Indexing Bug Fix + Normalization Improvements #100

Open
wants to merge 5 commits into
base: CMSSW_14_1_0_pre3_LST_X_LSTCore_realfiles_batch1_devel
Choose a base branch
from

Conversation

GNiendorf
Copy link
Member

@GNiendorf GNiendorf commented Sep 30, 2024

Fixes a bug related to the hit indexing in the DNN train script. Also adds other hits to write_lst_ntuple. Will add more to this PR soon.

@GNiendorf
Copy link
Member Author

/run all

Copy link

github-actions bot commented Oct 1, 2024

The PR was built and ran successfully in standalone mode. Here are some of the comparison plots.

Efficiency vs pT comparison Efficiency vs eta comparison
Fake rate vs pT comparison Fake rate vs eta comparison
Duplicate rate vs pT comparison Duplicate rate vs eta comparison

The full set of validation and comparison plots can be found here.

Here is a timing comparison:

   Evt    Hits       MD       LS      T3       T5       pLS       pT5      pT3      TC       Reset    Event     Short             Rate
   avg     44.2    324.5    115.0     46.6     91.8    501.5    111.6    149.9    100.4      2.2    1487.7     942.1+/- 246.4     406.7   explicit_cache[s=4] (target branch)
   avg     43.7    324.0    115.8     46.9     90.2    549.8    112.2    150.2    106.6      2.0    1541.2     947.7+/- 247.1     421.3   explicit_cache[s=4] (this PR)

@GNiendorf
Copy link
Member Author

No timing changes, see below on GPU.

Current Timing
Screenshot 2024-09-30 at 10 19 05 PM

This PR
Screenshot 2024-09-30 at 10 16 47 PM

@GNiendorf
Copy link
Member Author

GNiendorf commented Oct 1, 2024

Efficiency increases slightly for pT5's but no huge changes. Fake rate decreases in transition region, increases in barrel. Note: Plots below exaggerate the fake rate increase in the barrel because the y axis changes.

The AUC of the model increases significantly (from .88 -> .91) from fixing the bug but this doesn't translate to significantly better performance. Tracks in the transition region become easier to reject than tracks in the barrel. It seems like the eta value of the hit shared between the two T3's offers a lot of rejection power in that region using radius matching (inner vs. bridge, outer vs. bridge)

Overall, I don't think there is a large difference in physics performance or timing from fixing the bug. If the fake rate increase in the barrel is a concern, the easiest solution would be to choose a tighter cut in the barrel to match the performance of the current DNN in that region.

Copy link

github-actions bot commented Oct 1, 2024

The PR was built and ran successfully with CMSSW. Here are some plots.

OOTB All Tracks
Efficiency and fake rate vs pT, eta, and phi

The full set of validation and comparison plots can be found here.

@GNiendorf GNiendorf changed the title DNN Indexing Bug Fix DNN Indexing Bug Fix + Normalization Improvements Oct 1, 2024
@GNiendorf GNiendorf marked this pull request as ready for review October 6, 2024 02:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants