Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NCCL debug with benchmark_cuda #317

Open
insanum opened this issue Feb 9, 2022 · 1 comment
Open

NCCL debug with benchmark_cuda #317

insanum opened this issue Feb 9, 2022 · 1 comment

Comments

@insanum
Copy link

insanum commented Feb 9, 2022

I'm running benchmark_cuda with MPI and am setting various NCCL environment variables on the command line. When I specify -x NCCL_DEBUG=INFO I don't see any debug info being dumped on the console. Any ideas?

@insanum
Copy link
Author

insanum commented Feb 9, 2022

Possibly answering my own question I think I know what is going on. gloo isn't using NCCL to create all the transport paths between nodes and instead just using the broadcast/allreduce kernel code from NCCL to run on the GPUs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant