Add reboot and re-test to Testflinger job for DSS validation (New) #1634

motjuste · 2024-12-03T16:43:51Z

Description

The updates are only to the job definition template used by the GitHub workflow for running the DSS validations on machines from Testflinger. We reboot the machine and re-run the validations, with some delay in the middle to let things settle down.

Resolved issues

CHECKBOX-1669

Documentation

There are no changes to the documentation.

Tests

The relevant DSS workflow run: https://github.com/canonical/checkbox/actions/runs/12143951564

to save ourselves some time...

The job linked above was actually run on a different branch, which consists of commits from this PR, and PR #1633 together. This is because the workflow runs can take a very long time, and while their changes don't overlap, the two PRs are worth testing together.

The failures on testing DSS latest/edge are expected due to a reported issue and do not stem from these changes to Checkbox. The one failure testing DSS latest/stable is during provisioning from Testflinger, before the Checkbox validations are even installed.

codecov · 2024-12-05T14:15:00Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 48.75%. Comparing base (d0c208e) to head (2eda760).
Report is 5 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1634      +/-   ##
==========================================
- Coverage   48.80%   48.75%   -0.05%     
==========================================
  Files         369      369              
  Lines       40192    40191       -1     
  Branches     6783     6774       -9     
==========================================
- Hits        19616    19596      -20     
- Misses      19857    19889      +32     
+ Partials      719      706      -13

Flag	Coverage Δ
checkbox-ng	`69.11% <ø> (-0.10%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

pieqq

I don't think this is a good approach. You are running the same test plan twice, without having any way of checking at what point each of these tests have been run.

I am not sure how you plan to store the results. Are they only kept in Testflinger, or stored somewhere else (e.g. C3)?

Moreover, do you really want to re-run the whole test plan after rebooting, or would a subset of jobs be enough?

I would encourage you to write a dedicated test plan that includes a reboot job (this is a noreturn job, see Checkbox doc), with separate jobs depending on it run afterwards (see siblings in Checkbox documentation) to check that DSS is still alive and respond as expected to commands.

motjuste mentioned this pull request Dec 3, 2024

Add DSS integration tests for CPU and NVIDIA GPU (New) #1633

Open

motjuste marked this pull request as ready for review December 4, 2024 09:36

motjuste requested a review from a team as a code owner December 4, 2024 09:36

motjuste requested a review from pieqq December 4, 2024 09:36

reboot machine in testflinger job and re-run DSS validations

5a9b6ac

motjuste force-pushed the CHECKBOX-1669-dss-reboot-testing-using-testflinger branch from 2eda760 to 5a9b6ac Compare December 5, 2024 14:15

pieqq requested changes Dec 6, 2024

View reviewed changes

motjuste closed this Dec 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add reboot and re-test to Testflinger job for DSS validation (New) #1634

Add reboot and re-test to Testflinger job for DSS validation (New) #1634

motjuste commented Dec 3, 2024 •

edited

Loading

codecov bot commented Dec 5, 2024

pieqq left a comment •

edited

Loading

Add reboot and re-test to Testflinger job for DSS validation (New) #1634

Add reboot and re-test to Testflinger job for DSS validation (New) #1634

Conversation

motjuste commented Dec 3, 2024 • edited Loading

Description

Resolved issues

Documentation

Tests

codecov bot commented Dec 5, 2024

Codecov Report

pieqq left a comment • edited Loading

Choose a reason for hiding this comment

motjuste commented Dec 3, 2024 •

edited

Loading

pieqq left a comment •

edited

Loading