v1.0.3
• Remove glog dependence
• Add docker and Conda files, update PYPI packages
• Improve ideep enabling condition to avoid potential performance drop in backward.
• Fix bug in split_axis
• Add interface of multi_add
• Replace omp parallel with omp simd for mlsl