matrixvecmult executes kernels MatrixVectorMul1n which compute c = A * b where A is stored in row major form and c and b are vectors Using device ATI Radeon HD 6770M CPU took 0.009391 s Testing MatrixVectorMul1 WorkGroupSize = 64 GlobalSize 100032 Finished kernel execution Average kernel execution time 0.0240784 Found 0 different entries Testing MatrixVectorMul2 WorkGroupSize = 64 GlobalSize 3840 Finished kernel execution Average kernel execution time 0.0234127 Found 0 different entries Testing MatrixVectorMul3 WorkGroupSize = 64 GlobalSize 3840 Finished kernel execution Average kernel execution time 0.11509 Found 84943 different entries 0 entries more than 0.001% Testing MatrixVectorMul4 WorkGroupSize = 64 GlobalSize 3840 Finished kernel execution Average kernel execution time 0.0267073 Found 81490 different entries 0 entries more than 0.001% Testing MatrixVectorMul5 WorkGroupSize = 64 GlobalSize 3840 Finished kernel execution Average kernel execution time 0.0245024 Found 81430 different entries 0 entries more than 0.001%