Skip to content

Commit

Permalink
CHANGELOG: update for 4.1 release
Browse files Browse the repository at this point in the history
  • Loading branch information
evetsso authored Jan 25, 2021
1 parent a07759d commit c3110db
Showing 1 changed file with 4 additions and 12 deletions.
16 changes: 4 additions & 12 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,11 +2,12 @@

Full documentation for rocFFT is available at [rocfft.readthedocs.io](https://rocfft.readthedocs.io/en/latest/).

## [(Unreleased) rocFFT 1.0.10 for ROCm 4.1.0]
## [rocFFT 1.0.10 for ROCm 4.1.0]

### Added
- Explicitly specify MAX_THREADS_PER_BLOCK through _\_launch\_bounds\_ for all
manual kernels.
kernels.
- Switch to new syntax for specifying AMD GPU architecture names and features.

### Optimizations
- Optimized C2C/R2C 3D 64, 81, 100, 128, 200, 256 cube sizes.
Expand All @@ -18,15 +19,6 @@ Full documentation for rocFFT is available at [rocfft.readthedocs.io](https://ro
### Changed
- rocFFT now automatically allocates a work buffer if the plan
requires one but none is provided.

## [(Unreleased) rocFFT 1.0.9 for ROCm 4.0.0]

### Added
- Explicitly specify MAX_THREADS_PER_BLOCK through _\_launch\_bounds\_ for all
generated kernels.
- Switch to new syntax for specifying AMD GPU architecture names and features.

### Changed
- An explicit `rocfft_status_invalid_work_buffer` error is now
returned when a work buffer of insufficient size is provided.
- Updated online documentation.
Expand Down Expand Up @@ -90,4 +82,4 @@ Full documentation for rocFFT is available at [rocfft.readthedocs.io](https://ro
- Fixed correctness of certain large 1D, and 2D power-of-3, 5 sizes.
- Fixed incorrect buffer assignment for some even-length R2C transforms.
- Fixed `<cstddef>` inclusion on C compilers.
- Fixed incorrect results on non-unit strides with SBCC/SBRC kernels.
- Fixed incorrect results on non-unit strides with SBCC/SBRC kernels.

0 comments on commit c3110db

Please sign in to comment.