-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ACCESS-OM3
Deployment on Gadi Killed during esmf
build
Phase
#8
Comments
@CodeGat Where is the compilation happening? On a login or a compute node?
By default spack does parallelize builds. What it does not by default is to parallelize over builds, but that's a bit more tricky and not relevant here. |
The compilation is happening on a login node |
There are several limits enforced on the login nodes regarding memory usage and CPU time. I strongly recommend you move the builds to a compute node. That will require submitting a job though... |
And attendant wait in the queue for every build job. Would really like to avoid that if possible. |
Have confirmed that, for install of
Will need to test how long it takes to install something like |
Just to finish up on this conversation thread, it's now a per-model |
Pull request in question: #5
Within the above pull request, during the deployment of
ACCESS-OM3
, our jobs seemingly get killed by Gadi duringesmf
sbuild
phase. Examples of these runs are at the bottom of the PR:It may be something to do with an Out Of Memory Error being triggered by Gadi during that phase.
Potential Solutions
esmf
build phase?Pinging @micaeljtoliveira - have you had any experience with an issue like this when building
ACCESS-OM3
on Gadi?The text was updated successfully, but these errors were encountered: