Skip to content

Rocsparse add targeting OLCF Frontier #716

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 8 commits into from
Jun 12, 2023
Merged

Rocsparse add targeting OLCF Frontier #716

merged 8 commits into from
Jun 12, 2023

Conversation

mewall
Copy link
Collaborator

@mewall mewall commented Jun 12, 2023

Working rocsparse add and multiply fix

o New bml_add_ellpack() method using rocsparse
o New bml_sort_rocsparse_ellpack() method to sort csr column indices
o New bml_prune_rocsparse_ellpack() method to threshold the csr matrix
o Modify bml_multiply_ellpack() rocpsarse to use the new sort and prune
   - Addresses issue where unsorted array might be passed, potentially causing errors
o Progress benchmarks produce correct results using rocsparse

mewall added 6 commits May 10, 2023 11:40
o New bml_sort_rocsparse_ellpack() method to sort csr column indices
o New bml_prune_rocsparse_ellpack() method to threshold the csr matrix
o Modify bml_add_ellpack() and bml_multiply_ellpack() to use these
methods
o Progress benchmarks now produce correct results with rocsparse
@mewall
Copy link
Collaborator Author

mewall commented Jun 12, 2023 via email

@@ -21,6 +21,13 @@
#ifdef BML_USE_CUSPARSE
#include <cusparse.h>
#endif
#ifdef BML_USE_ROCSPARSE
// Copy rocsparse headers into src/rocsparse/ and edit rocsparse_functions.h to remove '[[...]]' text
#include "../rocsparse/rocsparse.h"
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks like this file is missing...

@mewall
Copy link
Collaborator Author

mewall commented Jun 12, 2023 via email

@jeanlucf22
Copy link
Collaborator

So we are expecting the user to do that? why not add this modified file to the repo?

@mewall
Copy link
Collaborator Author

mewall commented Jun 12, 2023 via email

@jeanlucf22
Copy link
Collaborator

/ccs/proj/csc304

Let's discuss. Personally I don't think it's a good idea to redistribute an edited version of AMD's code in BML, but there should be other ways to handle this. For example, the olcf consultants might be willing to patch the relevant rocm header on frontier/crusher, or maybe we can write a script to copy the headers to the right place and patch the relevant one. I was hoping AMD would fix this much sooner, I reported the issue way back in rocm 4.X...however, it is fixed in rocm 5.5 so at least it won't be an issue soon. Meanwhile I'd like to keep this PR focused on the add/multiply methods, and make this a separate issue, addressed in a future PR. The build has worked this way for a while.

________________________________ From: Jean-Luc Fattebert @.> Sent: Monday, June 12, 2023 2:06:29 PM To: lanl/bml Cc: Wall, Michael E; Author Subject: [EXTERNAL] Re: [lanl/bml] Rocsparse add targeting OLCF Frontier (PR #716) So we are expecting the user to do that? why not add this modified file to the repo? — Reply to this email directly, view it on GitHub<https://urldefense.com/v3/https://github.com/lanl/bml/pull/716*issuecomment-1588012478;Iw!!Bt8fGhp8LhKGRg!FoapLQMzN6RAilCBhA_djFyhCKzF_1ZetGrCRutSj42GJsXaUblDaxm1ZnhSMahjnRQ__lvembCthdH4hSZuEhQi$>, or unsubscribe<https://urldefense.com/v3/https://github.com/notifications/unsubscribe-auth/AA67VEPXK7BT22R7PR4YEU3XK5ZELANCNFSM6AAAAAAZDWRBGQ;!!Bt8fGhp8LhKGRg!FoapLQMzN6RAilCBhA_djFyhCKzF_1ZetGrCRutSj42GJsXaUblDaxm1ZnhSMahjnRQ__lvembCthdH4hbstSxkJ$>. You are receiving this because you authored the thread.Message ID: @.>

How about putting a copy of that file in our project shared directory at OLCF? /ccs/proj/csc304

@jeanlucf22 jeanlucf22 merged commit 74d150d into master Jun 12, 2023
@jeanlucf22 jeanlucf22 deleted the rocsparse_add branch June 12, 2023 22:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants