Description
Is your feature request related to a problem? Please describe.
Following Watt-Meyer et al., 2023, 2024; ACE & ACE2. Variables are normalized using a residual scaling approach such that predicting outputs equal to input would result in each variable contributing equally to the loss function.
This should be straightforward, since anemoi-datasets
already computes the tendencies statistics needed for this normalization strategy. However, the reference formulas (Appendix H) use the standard deviation of the mean-std normalized fields (not the unnormalized fields, which is what anemoi-datasets
actually computes these statistics for).
Describe the solution you'd like
See our proposed solution. This is an implementation of the reference formulas (Appendix H), using the tendencies statistics computed by anemoi-datasets
. We reworked the reference formulas so that the statistics of the unnormalized fields are used instead:
Let anemoi-datasets
actually computes the tendencies statistics for). Thus, the residual scaling consists of adding
In our implementation, the geometric mean for loop
, and multiplied afterwards to both stdev
doesn't exist in the statistics
dictionary (because the tendencies statistics weren't computed during the creation of the dataset), then the code fallbacks to using the stdev
instead, in which case the formulae above reduces to a mean-std normalization.
Additionally, we added the tendencies' stdev
(and its ratio to the stdev
) in the inspect
command in anemoi-datasets
.
Describe alternatives you've considered
No response
Additional context
No response
Organisation
Predictia Intelligent Data Solutions - DestinationEarth393
Metadata
Metadata
Assignees
Labels
Type
Projects
Status