Skip to content

Remove support for Partial Gangs (#3621) #79

Remove support for Partial Gangs (#3621)

Remove support for Partial Gangs (#3621) #79

Triggered via push May 28, 2024 18:36
Status Failure
Total duration 13s
Artifacts

pages.yml

on: push
deploy-gh-pages
3s
deploy-gh-pages
Fit to window
Zoom out
Zoom in

Annotations

6 errors
deploy-gh-pages
Process completed with exit code 1.
priorityclass_default: testsuite/testcases/basic/priorityclass_default.yaml#L1
unexpected event for job 01j0sqgtf4d003tqvfqdqaz4ya: expected event of type *api.EventMessage_Succeeded, but got &EventMessage_Failed{Failed:&JobFailedEvent{JobId:01j0sqgtf4d003tqvfqdqaz4ya,JobSetId:priorityclass_default-aDrqSWDHrMVbuspYFBNMtU,Queue:e2e-test-queue,Created:2024-06-20 02:28:23.879423875 +0000 UTC,ClusterId:Cluster1,Reason:Container sleep failed with exit code 128 because StartError: failed to create containerd task: failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: container init was OOM-killed (memory limit too low?): unknown ,ExitCodes:map[string]int32{},KubernetesId:553a7fdd-71e5-425d-b1a6-a4a554854fc7,NodeName:armada-test-worker,PodNumber:0,ContainerStatuses:[]*ContainerStatus{&ContainerStatus{Name:sleep,ExitCode:128,Message:failed to create containerd task: failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: container init was OOM-killed (memory limit too low?): unknown,Reason:StartError,Cause:Error,},},Cause:Error,PodName:armada-01j0sqgtf4d003tqvfqdqaz4ya-0,PodNamespace:personal-anonymous,},}: context canceled
gang: testsuite/testcases/basic/gang.yaml#L1
unexpected event for job 01j1gx26qvsh6nw86td9wsd8nh: expected event of type *api.EventMessage_Succeeded, but got &EventMessage_Failed{Failed:&JobFailedEvent{JobId:01j1gx26qvsh6nw86td9wsd8nh,JobSetId:gang-37YEeu3NBSYuTfptMrymr6,Queue:e2e-test-queue,Created:2024-06-29 02:27:42.468624081 +0000 UTC,ClusterId:,Reason:Job was attempted 1 times, and has been tried once on all nodes it can run on - this job will no longer be retried Final run error: etcdserver: request timed out,ExitCodes:map[string]int32{},KubernetesId:,NodeName:,PodNumber:0,ContainerStatuses:[]*ContainerStatus{},Cause:Error,PodName:,PodNamespace:,},}: context canceled