Skip to content

Commit

Permalink
update changelog
Browse files Browse the repository at this point in the history
  • Loading branch information
feizheng10 authored Jan 6, 2022
1 parent 09a68ed commit 29da58b
Showing 1 changed file with 12 additions and 12 deletions.
24 changes: 12 additions & 12 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -21,23 +21,23 @@ Full documentation for rocFFT is available at [rocfft.readthedocs.io](https://ro
### Removed
- The hipFFT API (header) has been removed from after a long deprecation period. Please use the [hipFFT](https://github.com/ROCmSoftwarePlatform/hipFFT) package/repository to obtain the hipFFT API.

## (Unreleased) rocFFT 1.0.15
## rocFFT 1.0.15

### Changed
- Re-aligned split device library into 4 roughly equal libraries.
- Implemented the FuseShim framework to replace the original OptimizePlan
- Implemented the generic buffer-assignment framework. The buffer assignment
is no longer performed by each node. We designed a generic algorithm to
test and pick the best assignment path.
- With the help of FuseShim, we can achieve more kernel-fusions as possible.
Do not read the imaginary part of the DC and Nyquist modes for even-length
With the help of FuseShim, we can achieve more kernel-fusions as possible.
- Do not read the imaginary part of the DC and Nyquist modes for even-length
complex-to-real transforms.

### Optimizations
- Optimized twiddle-conjugation; complex-to-complex inverse transforms should have similar performance to foward transforms now.
- Improved performance of single-kernel small 2D transforms.

## (Unreleased) rocFFT 1.0.14
## rocFFT 1.0.14

### Optimizations
- Optimized SBCC kernels of length 52, 60, 72, 80, 84, 96, 104, 108, 112, 160,
Expand All @@ -57,7 +57,7 @@ Full documentation for rocFFT is available at [rocfft.readthedocs.io](https://ro
### Changed
- Split 2D device code into separate libraries.

## [rocFFT 1.0.13 for ROCm 4.4.0]
## rocFFT 1.0.13 for ROCm 4.4.0

### Optimizations
- Improved many plans by removing unnecessary transpose steps.
Expand All @@ -77,7 +77,7 @@ Full documentation for rocFFT is available at [rocfft.readthedocs.io](https://ro
### Fixed
- Improved large 1D transform decompositions.

## [rocFFT 1.0.12 for ROCm 4.3.0]
## rocFFT 1.0.12 for ROCm 4.3.0

### Changed
Re-split device code into single-precision, double-precision, and miscellaneous kernels.
Expand Down Expand Up @@ -107,7 +107,7 @@ Full documentation for rocFFT is available at [rocfft.readthedocs.io](https://ro
- Fixed potential crashes in 3D transforms with unusual strides, for
SBCC-optimized sizes.

## [rocFFT 1.0.11 for ROCm 4.2.0]
## rocFFT 1.0.11 for ROCm 4.2.0

### Changed
Move device code into main library.
Expand All @@ -128,7 +128,7 @@ Full documentation for rocFFT is available at [rocfft.readthedocs.io](https://ro
- Fixed potential crashes when executing transforms on multiple devices.
(https://github.com/ROCmSoftwarePlatform/rocFFT/issues/310)

## [rocFFT 1.0.10 for ROCm 4.1.0]
## rocFFT 1.0.10 for ROCm 4.1.0

### Added
- Explicitly specify MAX_THREADS_PER_BLOCK through _\_launch\_bounds\_ for all
Expand All @@ -154,7 +154,7 @@ Full documentation for rocFFT is available at [rocfft.readthedocs.io](https://ro
### Fixed
- Fixed 4x4x8192 accuracy failure.

## [rocFFT 1.0.8 for ROCm 3.10.0]
## rocFFT 1.0.8 for ROCm 3.10.0

### Optimizations
- Optimized 1D length 10000 C2C case.
Expand All @@ -166,7 +166,7 @@ Full documentation for rocFFT is available at [rocfft.readthedocs.io](https://ro
- Fixed correctness of SBCC/SBRC kernels with non-unit strides.
- Fixed fused C2R kernel when a Bluestein transform follows it.

## [rocFFT 1.0.7 for ROCm 3.9.0]
## rocFFT 1.0.7 for ROCm 3.9.0

### Optimizations
- New R2C and C2R fused kernels to combine pre/post processing steps with transpose.
Expand All @@ -181,7 +181,7 @@ Full documentation for rocFFT is available at [rocfft.readthedocs.io](https://ro
- Fixed 2D C2R transform with length 1 on one dimension.
- Fixed potential thread unsafety in logging.

## [rocFFT 1.0.6 for ROCm 3.8.0]
## rocFFT 1.0.6 for ROCm 3.8.0

### Optimizations
- Improved performance of 1D batch-paired R2C transforms of odd length.
Expand All @@ -196,7 +196,7 @@ Full documentation for rocFFT is available at [rocfft.readthedocs.io](https://ro
### Fixed
- Fixed correctness of certain large 2D sizes.

## [rocFFT 1.0.5 for ROCM 3.7.0]
## rocFFT 1.0.5 for ROCM 3.7.0

### Optimizations
- Optimized C2C power-of-2 middle sizes.
Expand Down

0 comments on commit 29da58b

Please sign in to comment.