Skip to content

NetCDF output error on Kestrel GPUs #1551

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
sbidadi9 opened this issue Apr 4, 2025 · 3 comments
Open

NetCDF output error on Kestrel GPUs #1551

sbidadi9 opened this issue Apr 4, 2025 · 3 comments
Labels
bug:amr-wind Something isn't working no-issue-activity

Comments

@sbidadi9
Copy link
Contributor

sbidadi9 commented Apr 4, 2025

Bug description

When using more than one GPU node, writing AMR-Wind boundary data in NETCDF format generates errors:

Below is the output after one iteration of a multiphase simulation:

Image

This issue does not occur when running the same case on a single GPU node:

Image

AMR-Wind information

==============================================================================
AMR-Wind (https://github.com/exawind/amr-wind)

AMR-Wind version :: v3.4.0-30-gfe3799a1
AMR-Wind Git SHA :: fe3799a
AMReX version :: 25.02-23-g06b4a5b105f5

Exec. time :: Thu Apr 3 22:56:34 2025
Build time :: Apr 2 2025 09:50:56
C++ compiler :: NVHPC 23.9.0

MPI :: ON (Num. ranks = 4)
GPU :: ON (Backend: CUDA)
OpenMP :: OFF

Enabled third-party libraries:
NetCDF 4.9.2
``

@sbidadi9 sbidadi9 added the bug:amr-wind Something isn't working label Apr 4, 2025
@jrood-nrel
Copy link
Contributor

Can you try it with export CUDA_LAUNCH_BLOCKING=1 set?

@sbidadi9
Copy link
Contributor Author

sbidadi9 commented Apr 4, 2025

@jrood-nrel I will run the case again with the above setting.

Copy link

github-actions bot commented May 5, 2025

This issue is stale because it has been open 30 days with no activity.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug:amr-wind Something isn't working no-issue-activity
Projects
None yet
Development

No branches or pull requests

2 participants