Skip to content

boringmorning/a100-cluster-simulator

Repository files navigation

a100-cluster-simulator

Introduction

我的碩士論文研究題目是針對MIG GPU提出工作排程及資源分配方法。為此我設計了這個 a100 cluster simulator。 此 project 包含了 workload generator, cluster simulator 以及 result analyzer。main branch 為簡化後對稱的 a100 model,real branch 則為實際的、不對稱的 a100 model。

Compilation

./make

詳見 Makefile

Execution

  • 執行完整實驗的作法請參考 defaultRun.sh
  • 以下為個別的步驟:
  1. Generate workload
python genJob.py $i $j
  • i: # workload configs
  • j: # testcases for each config
  1. Simulated experiment
./main.exe $NGPU $i $j $k
  • NGPU: cluster 內 GPU 的數量
  • i: workload config ID
  • j: testcase ID
  • k: algorithm ID
  1. Result analyze
python result.py $i $j
  • i: # workload configs
  • j: # testcases for each config

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published