GitHub - devngho/tpuswarm: Create spot TPU instances, then run a batched job on them.

tpuswarm

Create spot TPU instances, then run a batched job on them.

This project was supported with Cloud TPUs from Google's TPU Research Cloud (TRC).⚡

Usage

Examples are in the example directory.

. login_gcp # for ssh-agent. You should run it first!

python tpuswarm.py --region=us-central2-b --project=your-project --tpu-device=v4-8 --node-count=4 --batch=512 --command="echo \"Hello, TPUs\!\" > /tmp/hello.txt" --port=5000 --host=0.0.0.0

python tpuswarm_clean.py --region=us-central2-b --project=your-project

Guide

Your program should host a HTTPS API at 8080, and accept POST requests with a JSON body of the following shape:

POST /batch
{
  "prompts": [ // the list of prompts to process
    "The quick brown fox jumps over the lazy dog.",
    "The quick brown fox jumps over the lazy dog.",
    "The quick brown fox jumps over the lazy dog.",
    "The quick brown fox jumps over the lazy dog."
  ],
  "samplings": { // the sampling parameters or any configuration you need
    "temperature": 0.5,
    "top_k": 50,
    "top_p": 0.95,
    "repetition_penalty": 1.0,
    "length": 128
  }
}

GET /heartbeat
200 OK

And return a JSON response of the following shape:

{
  "result": [
    // any shape you want
    "The quick brown fox jumps over the lazy dog.",
    "The quick brown fox jumps over the lazy dog.",
    "The quick brown fox jumps over the lazy dog.",
    "The quick brown fox jumps over the lazy dog."
  ]
}

You can send a same shape of request to the /batch endpoint at the tpuswarm endpoint, and it will distribute the requests to the TPUs and return the results. tpuswarm will split the requests into batch size, and send them to the TPUs in parallel.

Your API server should be HTTPS to ensure the security of the data. You can use a self-signed certificate for this purpose(it skips the certificate verification).

License

MIT License 💕

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
example		example
.gitignore		.gitignore
LICENSE		LICENSE
_tpuswarm.py		_tpuswarm.py
login_gcp		login_gcp
readme.md		readme.md
tpuswarm.py		tpuswarm.py
tpuswarm_clean.py		tpuswarm_clean.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tpuswarm

Usage

Guide

License

About

Releases

Packages

Languages

License

devngho/tpuswarm

Folders and files

Latest commit

History

Repository files navigation

tpuswarm

Usage

Guide

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages