CS/ECE/ME/EP 759 Spring 2021 Final Project

This README contains the code base for Rui Pan's final project report: Cautiously Aggressive GPU Space Sharing to Improve Resource Utilization and Job Efficiency.

Some of the prerequisites for replicating the results include:

An NVIDIA GPU with Volta architecture
Python 3.8 nightly build
CUDA-compatible PyTorch & TorchVision

This repo contains:

/data: Source data for running the workloads. It should be set up as follows:
- /imagenet: ImageNet Dataset for resnet50 workloads
- /ml-20m: MovieLens 20M Dataset for recommendation/recoder workloads
- wikitext2: WikiText-2 Dataset for language modeling workloads
/latex: LaTex files for editing the report on Overleaf
/output: Core-specific utilizations of workloads produced using an earlier version of the profiler
/tables: Shell scripts for replicating the profiling results in various tables
/workloads: Common DL/HPC workloads used in the evaluations. A lot of these are copied from Gavel.
plotting.ipynb: Jupyter Notebook that produces all figures in the report
profiler.py: Profiler parser wrapped around nvprof
pymps.py: Provides Python access to NVIDIA CUDA Multi-Process Service (MPS)
README.md: Well, of course I know him. He's me.
report.pdf: PDF version of the final report

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CS/ECE/ME/EP 759 Spring 2021 Final Project

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
latex		latex
output		output
tables		tables
workloads		workloads
.gitignore		.gitignore
README.md		README.md
kenobi.jpg		kenobi.jpg
plotting.ipynb		plotting.ipynb
profiler.py		profiler.py
pymps.py		pymps.py
report.pdf		report.pdf

ruipeterpan/cs759-sp21

Folders and files

Latest commit

History

Repository files navigation

CS/ECE/ME/EP 759 Spring 2021 Final Project

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages