Open
Description
So, As these lines shows, we get the initElapsedS & readElapsedS from the difference of each other. Is't a mistake or something meaningful I haven't understood?
Otherwise, I get the results on H800 using another closed-source NV-STREAM tool. It seems that it provided better bandwidth performance result compared with BabelStream because of the optimized block size parameters. What's more, it also and show Read & Write results. Could I take the Init_kernel as the Write result and read_arrays as Read result in BabelStream?
Metadata
Metadata
Assignees
Labels
No labels