Version 0.4.0

Pre-release

Pre-release

felipeZ released this 10 Feb 10:34

· 9 commits to master since this release

Offload the Eigen3 matrix-matrix multiplication to an Nvidia GPU using CUBLAS.

Changed

Split the memory management (CudaMatrix) from the CUBLAS invocation (CudaPipeline)
Moved all the allocation to the smart pointers inside CudaMatrix
Removed unused headers

Assets 2