fix: 1568 Fixing S3 Connection Building #1580

irishgordo · 2024-10-09T01:47:02Z

fixes s3 connection building, allowing us to wait for a short period of time to allow the secret for longhorn-system to be built on Harvester side

Resovles: fix/1568
See also: #1568

Which issue(s) this PR fixes:

What this PR does / why we need it:

Adds a bit of time before yielding back the backup config -> as Harvester takes a bit more time usually for it to have the secret built in longhorn-system in v1.4.0-rc1

Additional documentation or context

In the video, it is demo'd with an external bare-metal hp dl160 2 node v1.4.0-rc1 cluster that we can indeed replicate the issue outlined in 1568. We see we are moving to fast trying to build the fixture and the secret hasn't yet built on Harvester side in time. But if we give it a bit of a sleep, to allow Harvester to catch up to build the secret before yielding back the spec, everything is good. The video illustrates the points back to back.

s3-connection-building-issue.mp4

* fixes s3 connection building, allowing us to wait for a short period of time to allow the secret for longhorn-system to be built on Harvester side Resovles: fix/1568 See also: harvester#1568

khushboo-rancher · 2024-10-09T20:26:55Z

harvester_e2e_tests/integrations/test_4_vm_backup_restore.py

@@ -121,6 +121,8 @@ def config_backup_target(api_client, conflict_retries, backup_config, wait_timeo
        f'Failed to update backup target to {backup_type} with {config}\n'
        f"API Status({code}): {data}"
    )
+    # sleeping to allow longhorn secret to be built on Harvester side
+    sleep(5)


Do we have any way of checking the secret existence? Can we check that in a loop with some sleep?

Could we loop through checking the status of the backup target by looping through something like api_client.settings.backup_target_test_connection? Will that return true if the secret isn't there?

irishgordo · 2024-10-22T17:57:13Z

I've converted this back to "draft" as the fix will require checking a status on Longhorn that @lanfon72 was mentioning .
Sleeping is a band-aid -> and really this is most noticable on a loadout of:

A seperate located MinIO (S3 Compatible) Instance, running on a VM, "not" on MinIO Operator based Tenant (so no K8s helm chart based install of MinIO) -> located over a 1GB Symetrical Backbone Network and is backed by SSD not NVMe
A seperate Harvester Cluster, running on bare-metal, "that does not run" the MinIO instance -> backed by a 1GB Symetrical Backbone Network and is backed by SSD not NVMe
Running these tests on a seperate host/workstation, that is also backed by a 1GB Symetrical Backbone Network

It's not that this test is "failing" per say due to the test itself it's the hardware + network based setup that is impacting this test.

When running locally or over faster storage + network -> test failures happens less frequently.

fix: 1568 Fixing S3 Connection Building

3dab7e0

* fixes s3 connection building, allowing us to wait for a short period of time to allow the secret for longhorn-system to be built on Harvester side Resovles: fix/1568 See also: harvester#1568

irishgordo requested a review from a team October 9, 2024 01:47

irishgordo mentioned this pull request Oct 9, 2024

[TEST] TestBackupRestore::test_connection[S3] test fails making consecutive tests fail for backup & restore #1568

Open

khushboo-rancher reviewed Oct 9, 2024

View reviewed changes

irishgordo marked this pull request as draft October 22, 2024 17:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: 1568 Fixing S3 Connection Building #1580

fix: 1568 Fixing S3 Connection Building #1580

irishgordo commented Oct 9, 2024

khushboo-rancher Oct 9, 2024

noahgildersleeve Oct 21, 2024

irishgordo commented Oct 22, 2024

fix: 1568 Fixing S3 Connection Building #1580

Are you sure you want to change the base?

fix: 1568 Fixing S3 Connection Building #1580

Conversation

irishgordo commented Oct 9, 2024

Which issue(s) this PR fixes:

What this PR does / why we need it:

Additional documentation or context

khushboo-rancher Oct 9, 2024

Choose a reason for hiding this comment

noahgildersleeve Oct 21, 2024

Choose a reason for hiding this comment

irishgordo commented Oct 22, 2024