Skip to content

add nccl debug info to logs #1010

add nccl debug info to logs

add nccl debug info to logs #1010

Triggered via push November 13, 2024 19:38
Status Failure
Total duration 53m 1s
Artifacts
cleanup-job
6s
cleanup-job
Matrix: docker_image_build
Matrix: build-apptainer
Matrix: test-apptainer
Matrix: test_train-apptainer
Fit to window
Zoom out
Zoom in

Annotations

2 errors
test-apptainer (Torch_CUDA_12_3, pytorch)
Process completed with exit code 2.
test-apptainer (TF_CUDA_12_3, tensorflow)
Process completed with exit code 2.