Skip to content

Terraform and repetitions

Compare
Choose a tag to compare
@JMGaljaard JMGaljaard released this 06 Sep 14:27
· 58 commits to main since this release

In short

This release revamps the deployment on GKE with Terraform, making deployment a breeze. Furthermore, the dependency list is slimmed down from Kubeflow to only Kubeflow-training-operators. This alliviates the overhead on your cluster, as for example istio is now no-longer required for deployment.

For experiments, the orchestrator allows for running repetitions of experiments directly. This allows to describe an experiment file once (e.g. a distributed learning configuration), and run it multiple times in a single deployment.

What's Changed

Full Changelog: v0.2.2...v0.3.0