Note to anyone who happens to stumble upon this repo

This allocator actually does not allow ROCm to make use of the GTT, but rather, actually allocates host memory. From my testing on radeon 680m integrated graphics, hipHostMalloc() seems to perform the same as allocating directly on the device, but starts drastically slowing down when the total memory allocation is higher than what the integrated graphics have dedicated. Unfortunately, this most likely isn't the fix you wished for, but it seemed to at least work from my testing.

PyTorch Host Allocator for APUs

ROCm does not take GTT into account when calculating usable VRAM on APU platforms.

With this allocator, now we can use GTT (host memory) with PyTorch, and there is no need to tweak VRAM configuration.

Usage

Compile gttalloc.c with hipcc gttalloc.cc -o alloc.so -shared -fPIC.

If hipcc is not found, it may reside in /opt/rocm/bin/hipcc.

Then, for programs using PyTorch, put code below between import torch and actual usage of Torch.

new_alloc = torch.cuda.memory.CUDAPluggableAllocator('<path to alloc.so>','gtt_alloc','gtt_free');
torch.cuda.memory.change_current_allocator(new_alloc)

It works!

If you have something work or not work with HIP on APUs, please share it in discussion.

Summary

To be filled...

中文指南

参见 https://typeof.pw/archives/pytorch-on-apu-vram .

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
README.md		README.md
gttalloc.c		gttalloc.c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Note to anyone who happens to stumble upon this repo

PyTorch Host Allocator for APUs

Usage

It works!

Summary

中文指南

About

Releases

Packages

Languages

GatienDoesStuff/torch-apu-helper

Folders and files

Latest commit

History

Repository files navigation

Note to anyone who happens to stumble upon this repo

PyTorch Host Allocator for APUs

Usage

It works!

Summary

中文指南

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages