GitHub - NaorYaacov/intel-optimization-for-horovod

Intel® Optimization for Horovod* is the distributed training framework for TensorFlow* and PyTorch*. The goal is to make distributed Deep Learning workload run faster and easier to use on Intel GPU devices. It's developed based on latest release version v0.26.1 of public Horovod.

Install

Hardware Requirements

Intel® Data Center GPU Max Series, Driver Version: 602

Software Requirement

Note: The patched PyTorch 1.13.0a0 is required to work with Intel® Extension for PyTorch* on Intel® graphics card for now.

Software	Installation requirement
Intel® oneAPI Base Toolkit	Install Intel® oneAPI Base Toolkit
TensorFlow	Install tensorflow 2.12.0
Intel® Extension for TensorFlow*	Install Intel® Extension for TensorFlow*
Pytorch	Install Pytorch 1.13.0a0
Intel® Extension for Pytorch*	Install Intel® Extension for Pytorch*
System	Ubuntu 22.04, RedHat 8.6 (64-bit), SUSE Linux Enterprise Server(SLES) 15 SP3/SP4
Python	3.8-3.10
Pip	19.0 or later (requires manylinux2014 support)

Install GPU Drivers

OS	Intel GPU	Install Intel GPU Driver
Ubuntu 22.04, RedHat 8.6, SLES 15 SP3/SP4	Intel® Data Center GPU Max Series	Refer to the Installation Guides for latest driver installation. If install the verified Intel® Data Center GPU Max Series/Intel® Data Center GPU Flex Series 602, please append the specific version after components.

Installation Channel:

Intel® Optimization for Horovod* can be installed through the following channels:

PyPI	Source
Install from pip	Build from source

Install for GPU

Installing Intel® Optimization for Horovod* with different frameworks is feasible. You could choose either Intel® Extension for TensorFlow* or Intel® Extension for Pytorch* as dependency.

Installing Intel® Extension for TensorFlow* and Intel® Optimization for Horovod* with command:

pip install tensorflow==2.12.0
pip install --upgrade intel-extension-for-tensorflow[gpu]
pip install intel-optimization-for-horovod

Installing Intel® Extension for Pytorch* and Intel® Optimization for Horovod* with command:

python -m pip install torch==1.13.0a0 -f https://developer.intel.com/ipex-whl-stable-xpu
python -m pip install intel_extension_for_pytorch==1.13.120+xpu -f https://developer.intel.com/ipex-whl-stable-xpu
pip install intel-optimization-for-horovod

Running Intel® Optimization for Horovod*

The example commands below show how to run distributed training.

To run on a machine with 2 Intel GPUs, which have 4 titles totally.
```
horovodrun -np 4 python train.py
```

To run on 4 machines with 2 GPUs(4 tiles) each:

horovodrun -np 16 -H server1:4,server2:4,server3:4,server4:4 python train.py

Running Intel® Optimization for Horovod* with tensorflow on Intel GPU

It is easy to train models with Intel® Extension for TensorFlow. You can refer to tensorflow examples for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 1,312 Commits
.buildkite		.buildkite
.github		.github
benchmark		benchmark
cmake		cmake
docker		docker
docs		docs
examples		examples
horovod		horovod
test		test
third-party-programs		third-party-programs
third_party		third_party
xpu_docs		xpu_docs
xpu_examples		xpu_examples
xpu_test		xpu_test
.clang-format		.clang-format
.gitignore		.gitignore
.gitmodules		.gitmodules
.readthedocs.yaml		.readthedocs.yaml
CHANGELOG.md		CHANGELOG.md
CMakeLists.txt		CMakeLists.txt
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile.test.cpu		Dockerfile.test.cpu
Dockerfile.test.gpu		Dockerfile.test.gpu
GOVERNANCE.md		GOVERNANCE.md
Jenkinsfile.ppc64le		Jenkinsfile.ppc64le
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
NOTICE		NOTICE
README.md		README.md
README.rst		README.rst
SECURITY.md		SECURITY.md
assert-package-versions.sh		assert-package-versions.sh
docker-compose.test.yml		docker-compose.test.yml
horovod.exp		horovod.exp
horovod.lds		horovod.lds
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Install

Hardware Requirements

Software Requirement

Install GPU Drivers

Installation Channel:

Install for GPU

Running Intel® Optimization for Horovod*

Running Intel® Optimization for Horovod* with tensorflow on Intel GPU

About

Releases

Packages

Languages

License

NaorYaacov/intel-optimization-for-horovod

Folders and files

Latest commit

History

Repository files navigation

Install

Hardware Requirements

Software Requirement

Install GPU Drivers

Installation Channel:

Install for GPU

Running Intel® Optimization for Horovod*

Running Intel® Optimization for Horovod* with tensorflow on Intel GPU

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages