v2.0.0_b1
1.Improve iDeep python package ReLU API to support Leaky-ReLU.
2.Improve iDeep python package BatchNormalization and Pooling2D API for better performance and precision.
3.Avoid AVX-SSE transition penalties to improve NMT performance about xxx%.
4.Bugfix for AlexNet training divergence.
5.Bugfix for AlexNet/VGG/SSD 3-5 percentage drop from SOTA.
6.Improve iDeep tensor(mdarray) compatibility.