Torch Batcher

Serve batched requests using redis, can scale linearly by increasing the number of workers per device and along devices.

Dependencies

Install Redis
pip3 install -r requriments.txt

Usage

For Linear Scaling, start nvidia-cuda-mps-control, Check Section 2.1.1 GPU utilization for details.

nvidia-cuda-mps-control -d # To start

# To exit mps after stoping the server do.
nvidia-cuda-mps-control # Will enter the command prompt
quit # enter command to quit

Start Redis
```
redis-server --save "" --appendonly no
```

Start Batch-Serving

supervisord -c supervisor.conf # Start 3 workers on a single gpu

Start Batch benchmark
```
python3 bench_batched.py
```

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.gitignore		.gitignore
README.md		README.md
bench_batched.py		bench_batched.py
client.py		client.py
infer.py		infer.py
requirements.txt		requirements.txt
supervisord.conf		supervisord.conf
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Torch Batcher

Dependencies

Usage

About

Releases

Packages

Languages

SABER-labs/torch_batcher

Folders and files

Latest commit

History

Repository files navigation

Torch Batcher

Dependencies

Usage

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages