Warning
Docker Machine has been officially deprecated since September 2021.
GitLab have forked the project and will maintains it until the end of the calendar year 2024.
Unless you are already using it, this is certainly NOT a good idea to start using it now.
Please refer to the Grid'5000 user guide for more up-to-date deployments methods for your experiments.
Docker Machine driver for the Grid'5000 testbed infrastructure.
You need a Grid'5000 account to use this driver. See this page to create an account.
You need to be connected to the Grid'5000 VPN to create and access your Docker node.
Do not forget to configure your DNS or use OpenVPN DNS auto-configuration.
Please follow the instructions from the Grid'5000 Wiki.
Binary releases for Linux, MacOS and Windows are available in the releases page.
On Linux and MacOS, you can use the following commands to install or upgrade the driver:
# download the binary for your OS and CPU architecture:
sudo curl -L -o /usr/local/bin/docker-machine-driver-g5k "<link to release>"
# grant execution rigths to the driver for everyone:
sudo chmod +x /usr/local/bin/docker-machine-driver-g5k
To use the Go tools, you need to set your GOPATH variable environment.
To get the code and compile the binary, run:
go get -u github.com/Spirals-Team/docker-machine-driver-g5k
Then, either put the driver in a directory filled in your PATH environment variable, or run:
export PATH=$PATH:$GOPATH/bin
--g5k-username
: Your Grid'5000 account username (required)--g5k-password
: Your Grid'5000 account password (required)--g5k-site
: Site where the reservation of the node will be made (required)--g5k-walltime
: Duration of the resource reservation (inHH:MM:SS
format)--g5k-image
: Name of the system image to deploy on the node--g5k-resource-properties
: Resource selection with OAR properties--g5k-make-resource-reservation
: Make a resource reservation for the given start date--g5k-use-resource-reservation
: Use a resource reservation (need to be an existing job ID)--g5k-select-node-from-reservation
: Hostname of the node to use from the reservation--g5k-reuse-ref-environment
: Reuse the Grid'5000 reference environment instead of re-deploying the node--g5k-job-queue
: Specify the job queue (thebesteffort
queue is NOT supported)--g5k-external-ssh-public-keys
: SSH public key(s) allowed to connect to the node (in authorized_keys format)--g5k-keep-resource-at-deletion
: Keep the allocated resource when removing the machine--g5k-job-types
: Specify the OAR job type(s)
Flag name | Environment variable | Default value |
---|---|---|
--g5k-username |
G5K_USERNAME |
|
--g5k-password |
G5K_PASSWORD |
|
--g5k-site |
G5K_SITE |
|
--g5k-walltime |
G5K_WALLTIME |
"1:00:00" |
--g5k-image |
G5K_IMAGE |
"debian11-std" |
--g5k-resource-properties |
G5K_RESOURCE_PROPERTIES |
|
--g5k-make-resource-reservation |
G5K_MAKE_RESOURCE_RESERVATION |
|
--g5k-use-resource-reservation |
G5K_USE_RESOURCE_RESERVATION |
|
--g5k-select-node-from-reservation |
G5K_SELECT_NODE_FROM_RESERVATION |
|
--g5k-reuse-ref-environment |
G5K_REUSE_REF_ENVIRONMENT |
False |
--g5k-job-queue |
G5K_JOB_QUEUE |
"default" |
--g5k-external-ssh-public-keys |
G5K_EXTERNAL_SSH_PUBLIC_KEYS |
|
--g5k-keep-resource-at-deletion |
G5K_KEEP_RESOURCE_AT_DELETION |
False |
--g5k-job-types |
G5K_JOB_TYPES |
You can use OAR properties to only select a node that matches your hardware requirements.
If you give incorrect properties or no resource matches your request, you will get this error:
...
Error with pre-create check: "Error when submitting new job: The server returned an error (code: 400) after sending Job submission: '400 Bad Request'"
More information about usage of OAR properties are available on the Grid'5000 Wiki.
You can either do a job submission to reserve resources as soon as possible (this is the default mode) or do an advance reservation for a specific date/time.
To do a resource reservation, you need to use the --g5k-make-resource-reservation
flag and provide a starting date/time in either the YYYY-MM-DD HH:MM:SS
date format or an UNIX timestamp.
Don't forget to save the job ID of your reservation in order to be able to create a machine when the resources will be available.
To use a resource reservation, set the --g5k-use-resource-reservation
flag with the job ID of an existing reservation.
In case the reservation have multiple nodes, you need to select one using the --g5k-select-node-from-reservation
flag, otherwise the first node will be taken.
This will create a machine, deploy an OS image and provision Docker on the node. Please note that the job must be in running
state in order for the machine to be created, otherwise the driver will wait until the job start.
By default the resource is automatically deallocated when you remove a machine using the rm
command.
However, you can use the g5k-keep-resource-at-deletion
flag when creating the machine to keep the resource allocated even when the machine is removed.
This can be used as safeguard to protect from deallocating the resource when you use an advance reservation that have been approved by the Grid'5000 executive committee, or allow to redeploy the node OS image by removing and recreating the machine using the same resource reservation.
More information about the resources reservation are available on the Grid'5000 Wiki.
You can gain time by reusing the Grid'5000 reference environment instead of redeploying the machine.
Doing so will skip the node deployment phase and will save a lot of time at the machine creation.
If you don't need a tweaked environment or rely on Grid'5000 services (NFS for example), you should use this option.
Please note that, in this mode, if you reboot the machine by any mean, the reserved resource will be released and the node will be redeployed with the reference environment.
You can specify the job queue of your reservation and access the resources of the production queue.
The driver only support default
, production
and testing
queues. The besteffort
queue is NOT supported.
If you use an incorrect queue for your site you will get the following error:
...
Error with pre-create check: "Error when submitting new job: The server returned an error (code: 500) after sending Job submission: '500 Internal Server Error'"
See this page for more information about the production queue.
An example reusing the Grid'5000 standard environment:
docker-machine create -d g5k \
--g5k-username "user" \
--g5k-password "********" \
--g5k-site "lille" \
--g5k-reuse-ref-environment \
test-node
An example deploying the Ubuntu 20.04 (focal) (ubuntu2004-x64-min
) environment image on the node:
docker-machine create -d g5k \
--g5k-username "user" \
--g5k-password "********" \
--g5k-site "lille" \
--g5k-image "ubuntu2004-x64-min" \
test-node
An example using environment variables to configure the driver:
export G5K_USERNAME="user"
export G5K_PASSWORD="********"
export G5K_SITE="lille"
docker-machine create -d g5k test-node
An example using resource properties (node in chimint
cluster having more than 8GB of RAM and at least 4 CPU cores):
docker-machine create -d g5k \
--g5k-username "user" \
--g5k-password "********" \
--g5k-site "lille" \
--g5k-resource-properties "cluster = 'chimint' and memnode > 8192 and cpucore >= 4" \
test-node
An example doing a resource reservation of 1 node for 8 hours
starting the 2019-01-01
at 20:00:00
:
docker-machine create -d g5k \
--g5k-username "user" \
--g5k-password "********" \
--g5k-site "lille" \
--g5k-walltime "08:00:00" \
--g5k-make-resource-reservation "2019-01-01 20:00:00" \
test-node
An example using the first node of a resource reservation having the 1234567
job ID:
docker-machine create -d g5k \
--g5k-username "user" \
--g5k-password "********" \
--g5k-site "lille" \
--g5k-use-resource-reservation "1234567" \
test-node
An example using the chifflet-2.lille.grid5000.fr
node of a resource reservation having the 1234567
job ID:
docker-machine create -d g5k \
--g5k-username "user" \
--g5k-password "********" \
--g5k-site "lille" \
--g5k-use-resource-reservation "1234567" \
--g5k-select-node-from-reservation "chifflet-2.lille.grid5000.fr" \
test-node
An example adding two external SSH keys (the keys can be of any supported type and size):
docker-machine create -d g5k \
--g5k-username "user" \
--g5k-password "********" \
--g5k-site "lille" \
--g5k-external-ssh-public-keys "ssh-ed25519 AAAAC3NzaC1lZDI1NTE5AAAAIFLs3JzUYn7LbHE+SzJNoMvYbasnhjlen0k6dFs801DT test-ed25519" \
--g5k-external-ssh-public-keys "ssh-rsa AAAAB3NzaC1yc2EAAAADAQABAAAAgQC5qQt/nzGW19uCb9CDVEvP93LZ2mu3rd7drPP1nLf1pzLwlL2U2ksfwDCjMWU0P7KA6tB4scI+4dhxj07t0Z8g4TsMGYhbG0kjf7tWN73DombB4h/zobo2GvVoMg0NBLTP4peXLYAEofTYc0g7OWtJicAzLwcMzHsitDjjBwCKHQ== test-rsa" \
test-node
An example using the exotic
job type to reserve a node of the pyxis
cluster (exotic resources) from the lyon
site:
docker-machine create -d g5k \
--g5k-username "user" \
--g5k-password "********" \
--g5k-site "lyon" \
--g5k-job-types "exotic" \
--g5k-resource-properties "cluster = 'pyxis'" \
test-node