We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
terminating with uncaught exception of type mperf::MperfError: Failed to get a file descriptor for ASE_SPEC
The text was updated successfully, but these errors were encountered:
可能是没有PMU event的采样权限,如果能切换root用户可以用root试试,或者看有没有权限修改/proc/sys/kernel/perf_event_paranoid里面的值为-1
Sorry, something went wrong.
感谢解答,已经解决了,还有个问题想请教下,矩阵乘优化分块部分,根据L1D Cache的大小结合mr和nr计算出Kr,Kr=L1D/(mr+nr),L1D=32KB,mr+nr=20,最后怎么算出Kr=256?
抱歉,估计是当初算的人没有拿到测试手机上准确的L1 cache size,不可考证了。这里出入没有特别大,因为文档没说清楚还需要考虑sizeof(dtype),测试用的float数据类型,所以还需要除以4
No branches or pull requests
terminating with uncaught exception of type mperf::MperfError: Failed to get a file descriptor for ASE_SPEC
The text was updated successfully, but these errors were encountered: