Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve ElasticManager #203

Draft
wants to merge 4 commits into
base: master
Choose a base branch
from

Conversation

oschulz
Copy link
Contributor

@oschulz oschulz commented May 12, 2024

Adds several things to ElasticManager:

  • An callback option - this can be used to automatically run init code on new workers, add them to and remove them from worker pools, add custom logging when workers connect, etc.

  • More debug logging - often necessary to find out what's wrong if workers won't connect.

  • Add a mechanism to forward environment variables to workers. Havent' found a way to set them before the Julia worker process starts up, but at least sets them before it does anything.

I'm field-testing this via a local copy of ElasticManager in ParallelProcessingTools.jl (will release a new version soon) so I can make breaking changes still if necessary, but I'll keep this PR in sync to upstream once it seems fully stable (looking pretty good so far, so hopefully soon).

oschulz added 3 commits May 7, 2024 16:18
* Add callback mechanism. Allows users to automatically initialize
  new workers, add workers to a given worker pool, etc.

* Make it easy to set worker timeout.

* Add debug logging, often necessary to figure out worker connection
  problems.
Revise has Distributed support, workers shouldn't run Revise separately.
@oschulz
Copy link
Contributor Author

oschulz commented May 12, 2024

CC @JBlaschke , thanks for pointing out the potential of ElasticManager to me.

@oschulz
Copy link
Contributor Author

oschulz commented Jul 13, 2024

Will take a bit longer before I upstream the ElasticManager changes from ParallelProcessingTools, I want to see if there's a clean way to handle network device selection and if that requires interface changes.

@oschulz
Copy link
Contributor Author

oschulz commented Jan 2, 2025

@DilumAluthge , sorry, I neglected this a bit, I should really get on with getting this release-ready.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants