clBLAS  2.0
OpenCL BLAS

This is an implementation of Basic Linear Algebra Subprograms, levels 1, 2 and 3 using OpenCL and optimized for the AMD GPU hardware.