Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Silent timesync errors when configuration is inconsistent #354

Open
roland-sipos opened this issue Oct 30, 2024 · 2 comments
Open

Silent timesync errors when configuration is inconsistent #354

roland-sipos opened this issue Oct 30, 2024 · 2 comments
Assignees
Labels
bug Something isn't working

Comments

@roland-sipos
Copy link
Member

  • The trigger IF not receiving timesyncs (despite configured to do so in the RandomTCMaker, will silently log gazzilions of ERRORs (not visible on opmon) and send absurd windows.

    • Proper error handling is needed here. The trigger should send these errors to Kafka.
  • The RandomTC maker can also switch to system clock based TS-ing, instead of timesync. But when we attempted to STOP, the trigger app crashed. (What I saw is that some threads were not properly joined/stopped, but destroyed. )

    • We should check if the stop scenario is fine with either timesync and sysclock based decisions.

We encountered these errors when config issues and incosistencies were present on EHN1 DAQ configs.
To be cross-checked if the stop logic is intact with sysclock based estimation and failed timesync reception should send errors to Kafka/Opmon

@ArturSztuc
Copy link
Contributor

The first issue is linked to this one in iomanager: DUNE-DAQ/iomanager#48

@ArturSztuc
Copy link
Contributor

The second issue is resolved via this PR: #367

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants