rocSPARSE 3.3.0 for ROCm 6.3.0
amd-aakash
released this
04 Dec 22:38
·
2 commits
to release/rocm-rel-6.3
since this release
Added
- Add
rocsparse_create_extract_descr
,rocsparse_destroy_extract_descr
,rocsparse_extract_buffer_size
,rocsparse_extract_nnz
, androcsparse_extract
APIs to allow extraction of the upper or lower part of sparse CSR or CSC matrices. - Support for the gfx1151, gfx1200, and gfx1201 architectures.
Changed
- Change the default compiler from hipcc to amdclang in install script and cmake files.
- Change address sanitizer build targets so that only gfx908:xnack+, gfx90a:xnack+, gfx940:xnack+, gfx941:xnack+, and gfx942:xnack+ are built when
BUILD_ADDRESS_SANITIZER=ON
is configured.
Optimized
- Improved user documentation
Resolved issues
- Fixed the
csrmm
merge path algorithm so that diagonal is clamped to the correct range. - Fixed a race condition in
bsrgemm
that could on rare occasions cause incorrect results. - Fixed an issue in
hyb2csr
where the CSR row pointer array was not being properly filled whenn=0
,coo_nnz=0
, orell_nnz=0
. - Fixed scaling in
rocsparse_Xhybmv
when only performingy=beta*y
, for example, wherealpha==0
iny=alpha*Ax+beta*y
. - Fixed
rocsparse_Xgemmi
failures when the y grid dimension is too large. This occured when n >= 65536. - Fixed the gfortran dependency for the
azurelinux
operating system.