add nccl debug info to logs #1010
docker_action.yml
on: push
cleanup-job
6s
Matrix: docker_image_build
Matrix: build-apptainer
Matrix: test-apptainer
Matrix: test_train-apptainer
Annotations
2 errors
test-apptainer (Torch_CUDA_12_3, pytorch)
Process completed with exit code 2.
|
test-apptainer (TF_CUDA_12_3, tensorflow)
Process completed with exit code 2.
|