Adds grid metrics to NetCDF output #2652

tomchor · 2022-07-12T21:29:05Z

This PR adds grid spacings to NetCDF out, which is a step closer to solving #1334

EDIT:

The goal here is to add grid metrics to NetCDF output. The are two main avenues to follow:

We can follow Oceananigans nomenclature and conventions, which would make the output play more nicely with Oceananigans itself (and more generally in the Julia environment).
We can follow standard community conventions, which would mean the output won't follow Oceananigans naming etc., but it would optimize its readability by other software.

I think we should follow option 2, since if a user wants to work with the output in Oceananigans/Julia, then using JLD2 output is probably the right choice anyway. Given that most people in the community use Python, xarray and xgcm to analyze model output, I think we should optimize the output to work with that ecosystem out of the box. Based on the discussion in #1334, it seems the preferred conventions to use are the SGRID conventions

For the more technical aspects, I'm planning on starting with RectilinearGrids and LatLonGrids in this PR since these are more straightforward. And then we can expand from there. I also think this should be presented to the user as an opt-in flag in NetCDFWriter constructor, as opposed to being included in every NetCDF output by default.

tomchor · 2022-07-12T21:43:15Z

Do we currently have a function to retrieve spacings that considers whether a cell is "wet" or not? Meaning it considers whether a cell is part of the interior of the domain or is a halo/immersed solid cell when calculating Δz? Or maybe there's a way to use abstract operation metrics to do this that I'm not aware?

For example in the script below, I can retrieve Δz directly from grid, but the edges do not reflect the fact that the z direction is bounded.

julia> using Oceananigans

julia> grid = RectilinearGrid(size = (4, 4, 4), x=(0,1), y=(0,1), z=0:0.25:1)
4×4×4 RectilinearGrid{Float64, Periodic, Periodic, Bounded} on CPU with 3×3×3 halo
├── Periodic x ∈ [0.0, 1.0) regularly spaced with Δx=0.25
├── Periodic y ∈ [0.0, 1.0) regularly spaced with Δy=0.25
└── Bounded  z ∈ [0.0, 1.0] variably spaced with min(Δz)=0.25, max(Δz)=0.25

julia> grid.Δzᵃᵃᶠ[1:grid.Nz+1]
5-element Vector{Float64}:
 0.25
 0.25
 0.25
 0.25
 0.25

To be in line with xgcm's standards for metrics the output in the above case (considering only the "wet" or interior or fluid-filled cells when calculating Δz) should be

julia> grid.Δzᵃᵃᶠ[1:grid.Nz+1]
5-element Vector{Float64}:
 0.125
 0.25
 0.25
 0.25
 0.125

since the first and last spacings on a bounded grid are "half inside the domain and half outside".

I looked for ways to do this in that are already in the code but couldn't find anything. I wanted to ask before I started coding something from scratch.

simone-silvestri · 2022-07-12T21:48:39Z

Hmmm, no we do not have that but we have the inactive_cell, inactive_node and peripheral_node functions, which kind of do that. (would be half if peripheral_node and zero if inactive_node)

glwagner · 2022-07-15T21:17:48Z

Hmmm, no we do not have that but we have the inactive_cell, inactive_node and peripheral_node functions, which kind of do that. (would be half if peripheral_node and zero if inactive_node)

I don't think this is generically true. This is only true for certain immersed boundaries --- GridFittedBottom and GridFittedBoundary.

glwagner · 2022-07-15T21:19:50Z

To be in line with xgcm/xgcm#306 (comment) the output in the above case (considering only the "wet" or interior or fluid-filled cells when calculating Δz) should be

julia> grid.Δzᵃᵃᶠ[1:grid.Nz+1]
5-element Vector{Float64}:
 0.125
 0.25
 0.25
 0.25
 0.125

since the first and last spacings on a bounded grid are "half inside the domain and half outside".

This is interesting and I was not aware of this standard. To be honest, I do not understand the rationale for the standard that "chops" the face to face spacing at the boundary. I'm not sure this convention reflects a correct understanding of the staggered grid and it's interaction with the boundary. This does become functionally important when calculating gradients across the boundary (and when halo cells are used to enforce gradient or value boundary conditions). But perhaps I am missing the purpose of the convention?

glwagner · 2022-07-15T21:23:04Z

More generally though, we do need to design a function-based user interface for extracting grid metrics from any grid. This does not exist and it's not sustainable to access grid properties directly by writing things like grid.Δzᵃᵃᶠ. This method will also produce incorrect results for immersed boundaries that modify grid metrics, such as PartialCellBottom and a hypothetical cut-cell implemenation.

glwagner · 2023-03-22T16:53:19Z

Perhaps we are ready to revisit this?

tomchor · 2023-03-22T17:47:09Z

Perhaps we are ready to revisit this?

Yes, I was waiting for us to merge #2842. Let's keep this open

…l into tc/netcdf-distances

glwagner · 2023-04-20T15:09:25Z

src/Grids/grid_utils.jl

+                                                                   ynode(i,j,k, grid, ℓx, ℓy, ℓz)           - ynode(i,j-1,k, grid, ℓx, flip_loc(ℓy), ℓz))
+
+active_zspacing_at_boundary(i,j,k,grid, ℓx, ℓy, ℓz) = ifelse(k==1, znode(i,j,k, grid, ℓx, ℓy, flip_loc(ℓz)) - znode(i,j,k, grid, ℓx, ℓy, ℓz),
+                                                                   znode(i,j,k, grid, ℓx, ℓy, ℓz)           - znode(i,j,k-1, grid, ℓx, ℓy, flip_loc(ℓz)))


What do these do?

These implement the behavior defined here. I'm not sure if that's a good name for that function, but whatever we call it, I'm going to make sure it has a helpful docstring before we merge (if in fact that function survives). This I believe (I'm confirming it here) is the proper way to define spacings in a staggered grid following SGRID conventions.

src/Grids/grid_utils.jl

glwagner · 2023-04-20T15:10:41Z

Before writing too much code can we discuss what we need and come up with a design together?

tomchor · 2023-04-20T15:17:44Z

Before writing too much code can we discuss what we need and come up with a design together?

Yes I agree. I am just pushing some code that I did for my own use as a starting point. I'll post a discussion starter soon.

tomchor · 2023-04-20T16:14:33Z

Now that the spacing functions are in place, I'm going to start working on this. (I'll update the top post with the information below.)

The goal here is to add grid metrics to NetCDF output. The are two main avenues to follow:

We can follow Oceananigans nomenclature and conventions, which would make the output play more nicely with Oceananigans itself (and more generally in the Julia environment).
We can follow standard community conventions, which would mean the output won't follow Oceananigans naming etc., but it would optimize its readability by other software.

I think we should follow option 2, since if a user wants to work with the output in Oceananigans/Julia, then using JLD2 output is probably the right choice anyway. Given that most people in the community use Python, xarray and xgcm to analyze model output, I think we should optimize the output to work with that ecosystem out of the box. Based on the discussion in #1334, it seems the preferred conventions to use are the SGRID conventions

For the more technical aspects, I'm planning on starting with RectilinearGrids and LatLonGrids in this PR since these are more straightforward. And then we can expand from there. I also think this should be presented to the user as an opt-in flag in NetCDFWriter constructor, as opposed to being included in every NetCDF output by default.

tomchor · 2023-04-20T16:22:47Z

@jbusecke I hope you don't mind me recruiting your quick help here! I have a couple of questions regarding SGRID and xgcm, just to make sure I'm understanding things right:

Do things still work like your comment here describes? Meaning the metrics are still describing properties surrounding the point?

Also (related), does the treatment of spacings around boundary points still follow what you described here? (And what I tried to illustrate here for Oceananigans?)

Given these two points, can I build areas and volumes just by doing the normal operations with the correctly-defined spacings in each point?

Thanks!

glwagner · 2023-04-20T16:55:51Z

Do we currently have a function to retrieve spacings that considers whether a cell is "wet" or not?

inactive_node(i, j, k, grid, lx, ly, lz)

returns true if a cell is immersed / inactive, and false if a cell is active.

The boundary points are special, because their status depends on whether they are a prognostic or diagnostic field. Therefore we have another function,

peripheral_node(i, j, k, grid, lx, ly, lz)

which returns true if a cell is inactive or if it lies on the boundary between active and inactive (ie, the "periphery" of the domain). This distinction is only meaningful for Face-centered fields: Center locations cannot lie on the boundary.

julia> using Oceananigans

julia> grid = RectilinearGrid(size = (4, 4, 4), x=(0,1), y=(0,1), z=0:0.25:1)
4×4×4 RectilinearGrid{Float64, Periodic, Periodic, Bounded} on CPU with 3×3×3 halo
├── Periodic x ∈ [0.0, 1.0) regularly spaced with Δx=0.25
├── Periodic y ∈ [0.0, 1.0) regularly spaced with Δy=0.25
└── Bounded  z ∈ [0.0, 1.0] variably spaced with min(Δz)=0.25, max(Δz)=0.25

julia> grid.Δzᵃᵃᶠ[1:grid.Nz+1]
5-element Vector{Float64}:
 0.25
 0.25
 0.25
 0.25
 0.25

To be in line with xgcm's standards for metrics the output in the above case (considering only the "wet" or interior or fluid-filled cells when calculating Δz) should be

julia> grid.Δzᵃᵃᶠ[1:grid.Nz+1]
5-element Vector{Float64}:
 0.125
 0.25
 0.25
 0.25
 0.125

since the first and last spacings on a bounded grid are "half inside the domain and half outside".

Are you sure? For immersed boundaries, this would mean that all the metrics must be 3D arrays even when the grid directions are separable --- greatly inflating the size of your output file.

Or maybe there's a way to use abstract operation metrics to do this that I'm not aware?

It is simple. Write a KernelFunctionOperation that checks if a node is peripheral_node. If true, divide the metric by 2, otherwise, return the metric unchanged.

However, as noted above, I'm not sure this is what you want.

Perhaps there is a purpose to the convention that divides the cell sizes on the boundary by 2. However, I'm worried this could be misleading regarding how the staggered finite volume grid and its diagnostics are interpreted. Another concern is that modifying metrics prior to output will lead to difficulties in reproducible diagnostics in the future between xgcm computations and native Oceanangians computations. So in this case I hope we can keep things simple and simply save whatever xspacings, etc outputs, if possible.

ali-ramadhan · 2024-11-15T01:12:37Z

Now that #3143 has been merged, maybe developing and merging this PR an easier job. Do we just pass the KernelFunctionOperators from xspacings, yspacings, etc. and the correct metrics are compute!ed and outputted?

@tomchor Only 4 merge conflicts! But I was wondering if you could help resolve the conflicts since it's your code.

tomchor · 2024-11-15T16:07:28Z

I think conflicts are properly resolved, but not sure. Let's see what happens to the tests.

tomchor · 2025-04-14T12:29:38Z

Superseded by #4046

added default dimension attributes for spacings

20216ab

navidcy added the output 💾 label Jul 13, 2022

tomchor added 2 commits August 22, 2022 17:56

Merge branch 'main' into tc/netcdf-distances

da09453

Merge branch 'main' into tc/netcdf-distances

a8d70da

tomchor mentioned this pull request Feb 25, 2023

Creates functions for grid spacings #2842

Merged

Merge branch 'main' into tc/netcdf-distances

ec36123

tomchor mentioned this pull request Mar 22, 2023

Integrating Oceananigans with Python-xgcm #1334

Closed

tomchor added 5 commits April 16, 2023 16:37

Merge branch 'main' into tc/netcdf-distances

db58ac8

add active_x/y/zspacing functions

ebacc69

add grid_metric_output_netcdf.jl

a6203b6

Merge branch 'main' into tc/netcdf-distances

30f467f

Merge branch 'tc/netcdf-distances' of github.com:CliMA/Oceananigans.j…

ec1e035

…l into tc/netcdf-distances

glwagner reviewed Apr 20, 2023

View reviewed changes

src/Grids/grid_utils.jl Outdated Show resolved Hide resolved

tomchor marked this pull request as draft April 20, 2023 15:10

flip_loc -> flip

b8b8059

tomchor changed the title ~~Adds grid spacings to NetCDF output~~ Adds grid metrics to NetCDF output Apr 20, 2023

tomchor mentioned this pull request Jul 15, 2023

(0.94.0) Overhaul spacings functions to return KernelFunctionOperations #3143

Merged

Merge branch 'main' into tc/netcdf-distances

14b0e2c

ali-ramadhan mentioned this pull request Sep 13, 2024

Support curvilinear grids with NetCDFOutputWriter? #2248

Closed

ali-ramadhan mentioned this pull request Nov 13, 2024

MultiRegion adaptation of the NonhydrostaticModel #2795

Closed

ali-ramadhan mentioned this pull request Nov 15, 2024

Use FieldTimeSeries for analysis? tomchor/Oceanostics.jl#181

Open

tomchor added 2 commits November 15, 2024 13:04

Merge branch 'main' into tc/netcdf-distances

021e8a8

Delete validation/turbulence_closures/free_convection_smagorinsky.jl

7d9d1a4

ali-ramadhan mentioned this pull request Jan 16, 2025

(0.96.0) Improve the NetCDFOutputWriter experience #4046

Merged

5 tasks

tomchor closed this Apr 14, 2025

glwagner deleted the tc/netcdf-distances branch April 14, 2025 16:29

Adds grid metrics to NetCDF output #2652

Adds grid metrics to NetCDF output #2652

Uh oh!

Conversation

tomchor commented Jul 12, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tomchor commented Jul 12, 2022

Uh oh!

simone-silvestri commented Jul 12, 2022

Uh oh!

glwagner commented Jul 15, 2022

Uh oh!

glwagner commented Jul 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

glwagner commented Jul 15, 2022

Uh oh!

glwagner commented Mar 22, 2023

Uh oh!

tomchor commented Mar 22, 2023

Uh oh!

glwagner Apr 20, 2023

Choose a reason for hiding this comment

Uh oh!

tomchor Apr 20, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

glwagner commented Apr 20, 2023

Uh oh!

tomchor commented Apr 20, 2023

Uh oh!

tomchor commented Apr 20, 2023

Uh oh!

tomchor commented Apr 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

glwagner commented Apr 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ali-ramadhan commented Nov 15, 2024

Uh oh!

tomchor commented Nov 15, 2024

Uh oh!

tomchor commented Apr 14, 2025

Uh oh!

Uh oh!

tomchor commented Jul 12, 2022 •

edited

Loading

glwagner commented Jul 15, 2022 •

edited

Loading

tomchor commented Apr 20, 2023 •

edited

Loading

glwagner commented Apr 20, 2023 •

edited

Loading