From c3110db4d5451d17355c0e010ec623e8db68db40 Mon Sep 17 00:00:00 2001 From: Steve Leung Date: Mon, 25 Jan 2021 14:26:06 -0700 Subject: [PATCH] CHANGELOG: update for 4.1 release --- CHANGELOG.md | 16 ++++------------ 1 file changed, 4 insertions(+), 12 deletions(-) diff --git a/CHANGELOG.md b/CHANGELOG.md index 814f24f9..6d00eb4c 100644 --- a/CHANGELOG.md +++ b/CHANGELOG.md @@ -2,11 +2,12 @@ Full documentation for rocFFT is available at [rocfft.readthedocs.io](https://rocfft.readthedocs.io/en/latest/). -## [(Unreleased) rocFFT 1.0.10 for ROCm 4.1.0] +## [rocFFT 1.0.10 for ROCm 4.1.0] ### Added - Explicitly specify MAX_THREADS_PER_BLOCK through _\_launch\_bounds\_ for all - manual kernels. + kernels. +- Switch to new syntax for specifying AMD GPU architecture names and features. ### Optimizations - Optimized C2C/R2C 3D 64, 81, 100, 128, 200, 256 cube sizes. @@ -18,15 +19,6 @@ Full documentation for rocFFT is available at [rocfft.readthedocs.io](https://ro ### Changed - rocFFT now automatically allocates a work buffer if the plan requires one but none is provided. - -## [(Unreleased) rocFFT 1.0.9 for ROCm 4.0.0] - -### Added -- Explicitly specify MAX_THREADS_PER_BLOCK through _\_launch\_bounds\_ for all - generated kernels. -- Switch to new syntax for specifying AMD GPU architecture names and features. - -### Changed - An explicit `rocfft_status_invalid_work_buffer` error is now returned when a work buffer of insufficient size is provided. - Updated online documentation. @@ -90,4 +82,4 @@ Full documentation for rocFFT is available at [rocfft.readthedocs.io](https://ro - Fixed correctness of certain large 1D, and 2D power-of-3, 5 sizes. - Fixed incorrect buffer assignment for some even-length R2C transforms. - Fixed `` inclusion on C compilers. -- Fixed incorrect results on non-unit strides with SBCC/SBRC kernels. \ No newline at end of file +- Fixed incorrect results on non-unit strides with SBCC/SBRC kernels.