You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I was benchmarking this on my M4 Pro Macbook Pro (10 core) and I noticed that although I get benefits for some BLAS operations (matrix multiplications etc) SVD is a fair bit slower than libopenblas64. Here's the results I get on my machine:
AppleAccelerate
In [1]: using LinearAlgebra, BenchmarkTools, AppleAccelerate
In [2]: A =rand(ComplexF32, 2000, 2000);
In [3]: @btimesvd(A);
1.548 s (21 allocations: 169.04 MiB)
10 BLAS threads
In [1]: using LinearAlgebra, BenchmarkTools
In [2]: BLAS.get_num_threads()
Out[2]: 10
In [3]: A =rand(ComplexF32, 2000, 2000);
In [4]: @btimesvd(A);
841.862 ms (21 allocations: 169.04 MiB)
The text was updated successfully, but these errors were encountered:
I was benchmarking this on my M4 Pro Macbook Pro (10 core) and I noticed that although I get benefits for some BLAS operations (matrix multiplications etc) SVD is a fair bit slower than libopenblas64. Here's the results I get on my machine:
The text was updated successfully, but these errors were encountered: