Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Release 2.2.x testing #2118

Merged
merged 1 commit into from
Jan 17, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
8 changes: 8 additions & 0 deletions userbenchmark/release-test/configs/2.2.2.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@
cuda:
- version: 12.1
magma_version: magma-cuda121
pytorch:
- version: 2.2.0
conda_channel: pytorch-test
- version: 2.1.2
conda_channel: pytorch
2 changes: 1 addition & 1 deletion userbenchmark/release-test/result_analyzer.py
Original file line number Diff line number Diff line change
Expand Up @@ -104,4 +104,4 @@ def analyze(work_dir: Path):
dump_result_csv(work_dir, results)
# dump results to userbenchmark object
results = dump_userbenchmark_result(results)
return results
return results
5 changes: 2 additions & 3 deletions userbenchmark/release-test/run_release_test.sh
Original file line number Diff line number Diff line change
Expand Up @@ -16,9 +16,6 @@ fi

. switch-cuda.sh "${CUDA_VERSION}"

if [[ ${CUDA_VERSION} == "12.1" ]]; then
pip install nvidia-cuda-nvcc-cu12
fi

nvcc --version
sudo apt-get install bc
Expand All @@ -28,12 +25,14 @@ pushd "${EXAMPLES_DIR}/mnist"
export LOG_FILE=${RESULT_DIR}/mnist/result.log
export MEM_FILE=${RESULT_DIR}/mnist/result_mem.log
${PREFIX} bash "${CURRENT_DIR}/monitor_proc.sh" python main.py --epochs 10

# run mnist-hogwild
mkdir -p ${RESULT_DIR}/mnist_hogwild
pushd "${EXAMPLES_DIR}/mnist_hogwild"
export LOG_FILE=${RESULT_DIR}/mnist_hogwild/result.log
export MEM_FILE=${RESULT_DIR}/mnist_hogwild/result_mem.log
${PREFIX} bash "${CURRENT_DIR}/monitor_proc.sh" python main.py --epochs 10

# run CPU WLM LSTM
mkdir -p ${RESULT_DIR}/wlm_cpu_lstm
pushd "${EXAMPLES_DIR}/word_language_model"
Expand Down
12 changes: 8 additions & 4 deletions userbenchmark/release-test/setup_env.sh
Original file line number Diff line number Diff line change
Expand Up @@ -29,18 +29,22 @@ conda update --all -y

sudo ln -sf /usr/local/cuda-${CUDA_VERSION} /usr/local/cuda
conda uninstall -y pytorch torchvision pytorch-cuda
conda uninstall -y pytorch torchvision
conda uninstall -y pytorch torchvision torchaudio
# make sure we have a clean environment without pytorch
pip uninstall -y torch torchvision
pip3 uninstall -y torch torchvision torchaudio torch-tensorrt

# install magma
conda install -y -c pytorch ${MAGMA_VERSION}

# install pip version of pytorch and torchvision
if [[ ${PYTORCH_CHANNEL} == "pytorch-test" ]]; then
pip3 install torch torchvision --index-url https://download.pytorch.org/whl/test/cu${CUDA_VERSION//./}
pip3 install --pre torch==2.2.0 --extra-index-url https://download.pytorch.org/whl/test/cu121
pip3 install --pre torchvision==0.17.0 torchaudio==2.2.0 --extra-index-url https://download.pytorch.org/whl/test/cu121
#conda install -y pytorch=2.1.2 torchvision=0.16.2 torchaudio=2.1.2 pytorch-cuda=12.1 -c pytorch-test -c nvidia
#pip3 install --force-reinstall --no-cache-dir torch==2.1.2 torchvision==0.16.2 torchaudio==2.1.2 --index-url https://download.pytorch.org/whl/test/cu${CUDA_VERSION//./}
else
pip3 install torch torchvision
pip3 install --force-reinstall --no-cache-dir torch torchvision torchaudio
#conda install -y pytorch=2.1.1 torchvision=0.16.1 torchaudio=2.1.1 pytorch-cuda=12.1 -c pytorch -c nvidia
fi

python -c 'import torch; print(torch.__version__); print(torch.version.git_version)'
Expand Down