Releases · intel/ideep

1.Improve iDeep python package ReLU API to support Leaky-ReLU.
2.Improve iDeep python package BatchNormalization and Pooling2D API for better performance and precision.
3.Avoid AVX-SSE transition penalties to improve NMT performance about xxx%.
4.Bugfix for AlexNet training divergence.
5.Bugfix for AlexNet/VGG/SSD 3-5 percentage drop from SOTA.
6.Improve iDeep tensor(mdarray) compatibility.

Assets 2

03 May 06:36

mingxiaoh

v2.0.0_pr1

a861d8c

v2.0.0_pr1: Update README Pre-release

Pre-release

• Implement simple and general stateless C++ API to improve programmable.
• Implement CNN primitive computation LRU, scratch allocator and self-adapted data format optimization to maximize MKLDNN performance.
• Integrate iDeep python package (ideep4py) with stateless C++ API.

Assets 2

13 Apr 10:05

mingxiaoh

v1.0.4

b20490e

v1.0.4

• Update conda and docker files
• Improve iDeep MDarray compatibility about array reshape & python buffer protocol.
• Update mkldnn version to '464c268e544bae26f9b85a2acb9122c766a4c396'
• Restrict upper limit of build process to the number of CPUs(thanks@tkng)

Assets 2

06 Mar 03:30

mingxiaoh

v1.0.3

d33ac3e

v1.0.3

• Remove glog dependence
• Add docker and Conda files, update PYPI packages
• Improve ideep enabling condition to avoid potential performance drop in backward.
• Fix bug in split_axis
• Add interface of multi_add
• Replace omp parallel with omp simd for mlsl

Assets 2

12 Feb 08:14

opencici2006

v1.0.2

68ca869

v1.0.2

• Update test cases.
• Specify NumPy dependency version (NumPy 1.13 for now).
• Fix default parameter bug in sum along axis
• Fix bug in concat and split axis.

Assets 2