[251] Merge new IO class #469

tjhunter · 2025-07-07T15:56:42Z

Description

Pulls the changes of @grassesi and @tjhunter into the dev branch.

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation update

Issue Number

It also includes #456 in order to test it.

Closes #251

* Implement mock IO * Adapt score class

* changes * changes

* Rebased to the latest changes and linted new changes * addressed review comments * addressed review comments * Linted the latest changes. * corrected the formating * corrected the formating * configured ruff to use LF line endings in pyproject.toml

* working * changes * removing deps from non-core project * changes * fixes * comments

* remove database folder * fix database

* Simplifying workflow for plot_training * Ruffed * Working on implementing exclude_source * Remove unused code * Fixed ruff issue

* Fixing bug in lat handling * Added comment --------- Co-authored-by: Seb Hickman <[email protected]>

* recover num_ranks from previous run to calculate epoch_base * set email settings for commits * addressing Tim's comment * make ruff happy * improve style

Linter rule so np.ndarray is not used as type

#376) * changed the script name from evaluate to inference as it simply generate infer samples * changed evaluate to inference in the main scripts and corresponding calls in the config * update the main function for the inference script * changed evaluate to inference also in docstring, unit test scripts, and integration test scripts --------- Co-authored-by: Patnala,Ankit <[email protected]>

* Exclude channels from src / target * Simplified code and added comment that pattern matching is used * Adding new stream config * Fixing bug that led to error when accessing self.ds when dataset is empty * Wokign on exlcude_source * work in progress * Fixing incorrect formating for logger (#388) * Ruffed * Refactored and cleaned up channel selection. Also added check that channels are not empty * Cleaned channel parsing and selection * Adjustments * Removing asserts incompatible with empty dataset --------- Co-authored-by: Christian Lessig <[email protected]>

* chanegs * mistake * mistake * mistake * changes * doc

* creating masking class and adapting tokenizer_masking to use this class * minor changes to masking.py and tokenizer_masking * removed old tokenizer_masking * include masking_strategy in default_config * change ValueError to assert * linting formatting changes files * further linting of docstrings * create mask_source and mask_target in Masker, and update tokenizer_masking to use these, then style improvements * linted masking, tokenizer_masking * modify masker, rng and perm_sel now part of class, remove extra masking_rate, update comments, remove archived class * remove check if all masked, not masked * remove self.masking_rate from MultiStreamDS class, and masking args from batchify_source * update tokenizer utils with description of idx_ord_lens in comment * remove masking args from batchify_, perm_sel removed now internal to Masker class, remove handling special cases of masking (all masked) * adding masking_strategy: to config * remove unused mentions of masking_combination * removed comment about streams * changed assert to check self perm_sel is not None * ruff masking, tokenizer_masking * Ruffed * Added warning to capture corner case, likely due to incorrect user settings. * Fixed incorrect call twice * Fixed missing conditional for logger statement * Required changes for better handling of rngs * Improved handling of rngs * Improved handling of rng --------- Co-authored-by: Christian Lessig <[email protected]>

* Fix bug with seed being divided by 0 for worker ID=0 * Fix bug causing crash when secrets aren't in private config * Implement logging losses per channel * Fix issue with empty targets * Rework loss logging * ruff * Remove computing max_channels * Change variables names * ruffed * Remove redundant enumerations * Use stages for logging * Add type hints * Apply the review * ruff * fix * Fix type hints * ruff --------- Co-authored-by: Tim Hunter <[email protected]>

* changes * fixes

tjhunter · 2025-07-08T12:06:11Z

Verbally by @clessig

* Implement mock IO (ecmwf#336) * Adapt score class score class (ecmwf#339) * Implement mock IO * Adapt score class * Removing unused file (ecmwf#349) * remove database folder (ecmwf#355) * Small change - CI - pinning the version of formatting (ecmwf#361) * changes * changes * Update INSTALL.md * Update INSTALL.md * Fixed Exxx lint issues (ecmwf#284) * Rebased to the latest changes and linted new changes * addressed review comments * addressed review comments * Linted the latest changes. * corrected the formating * corrected the formating * configured ruff to use LF line endings in pyproject.toml * [357] Sub-package for evaluation (ecmwf#359) * working * changes * removing deps from non-core project * changes * fixes * comments * Iluise quick fix stac (ecmwf#374) * remove database folder * fix database * Simplifying workflow for plot_training (ecmwf#368) * Simplifying workflow for plot_training * Ruffed * Working on implementing exclude_source * Remove unused code * Fixed ruff issue * Fixing bug in lat handling (377) (ecmwf#378) * Fixing bug in lat handling * Added comment --------- Co-authored-by: Seb Hickman <[email protected]> * recover num_ranks from previous run to calculate epoch_base (ecmwf#317) * recover num_ranks from previous run to calculate epoch_base * set email settings for commits * addressing Tim's comment * make ruff happy * improve style * changes (ecmwf#385) Linter rule so np.ndarray is not used as type * changed the script name from evaluate to inference as it simply gener… (ecmwf#376) * changed the script name from evaluate to inference as it simply generate infer samples * changed evaluate to inference in the main scripts and corresponding calls in the config * update the main function for the inference script * changed evaluate to inference also in docstring, unit test scripts, and integration test scripts --------- Co-authored-by: Patnala,Ankit <[email protected]> * Introduce tuples instead for strings to avoid TypeError (ecmwf#392) * Exclude channels from src / target (ecmwf#363) * Exclude channels from src / target * Simplified code and added comment that pattern matching is used * Adding new stream config * Fixing bug that led to error when accessing self.ds when dataset is empty * Wokign on exlcude_source * work in progress * Fixing incorrect formating for logger (ecmwf#388) * Ruffed * Refactored and cleaned up channel selection. Also added check that channels are not empty * Cleaned channel parsing and selection * Adjustments * Removing asserts incompatible with empty dataset --------- Co-authored-by: Christian Lessig <[email protected]> * add embed_dropout_rate to config v1 (ecmwf#358) * [402] adds checks to the pull request (ecmwf#403) * chanegs * mistake * mistake * mistake * changes * doc * Introduce masking class and incorporate in TokenizerMasking (ecmwf#383) * creating masking class and adapting tokenizer_masking to use this class * minor changes to masking.py and tokenizer_masking * removed old tokenizer_masking * include masking_strategy in default_config * change ValueError to assert * linting formatting changes files * further linting of docstrings * create mask_source and mask_target in Masker, and update tokenizer_masking to use these, then style improvements * linted masking, tokenizer_masking * modify masker, rng and perm_sel now part of class, remove extra masking_rate, update comments, remove archived class * remove check if all masked, not masked * remove self.masking_rate from MultiStreamDS class, and masking args from batchify_source * update tokenizer utils with description of idx_ord_lens in comment * remove masking args from batchify_, perm_sel removed now internal to Masker class, remove handling special cases of masking (all masked) * adding masking_strategy: to config * remove unused mentions of masking_combination * removed comment about streams * changed assert to check self perm_sel is not None * ruff masking, tokenizer_masking * Ruffed * Added warning to capture corner case, likely due to incorrect user settings. * Fixed incorrect call twice * Fixed missing conditional for logger statement * Required changes for better handling of rngs * Improved handling of rngs * Improved handling of rng --------- Co-authored-by: Christian Lessig <[email protected]> * Implement per-channel logging (ecmwf#283) * Fix bug with seed being divided by 0 for worker ID=0 * Fix bug causing crash when secrets aren't in private config * Implement logging losses per channel * Fix issue with empty targets * Rework loss logging * ruff * Remove computing max_channels * Change variables names * ruffed * Remove redundant enumerations * Use stages for logging * Add type hints * Apply the review * ruff * fix * Fix type hints * ruff --------- Co-authored-by: Tim Hunter <[email protected]> * [346] Passing options through the slurm script (ecmwf#400) * changes * fixes * refactor `validation_io.write_validation` to make it more readable * remove legacy code `validation_io.read_validation` * encapsulate artifact path logic in config module * remove redundant attribute `Trainer.path_run` * use config to look up base_path in `write_validation` * remove unused `write_validation` args: `base_path`, `rank` * ensure correct type for pathes * remove streams initialization from `Trainer` * remove path logic from `Trainer.save_model` * simplify conditional * rename mock io module * update uv to include dask * Implement io module to support reading/writing model output * implement new validation_io routine * use new write_validation routine * remove unused code * rename output routine to `write_output` * ruffed and added comments * fixed annotation * use simple __init__ method for `OutputItem` instead of dataclasses magic * address reviewers comments * rename method * add simple docstrings * ruffed * typehint fixes * refactor names * update comments and typehints, dont import pytorch * remove `__post_init__` methods, cache properties * fixes and integration test * final fixes :) * changes * changes * changes * changes * changes * more work * changes * changes * changes * ruffed * ruffed * improve logging and comments * Update to score-class according to internal discussions and feedback in PR. * Add license header. * Ruffed code. * Update to score-class according to internal discussions and feedback in PR. * Add license header. * Ruffed code. * Add doc-string to call-method and provide example usage for efficient graph-construction. * Some fixes to score-class. * Some fixes to handling aggregation dimension. * Add missing import of MockIO. * changes * changes * removing the scores * changes * changes * changes * changes * changes * changes * changes * changes * changes * changes * changes * changes * changes * changes --------- Co-authored-by: Kacper Nowak <[email protected]> Co-authored-by: Christian Lessig <[email protected]> Co-authored-by: iluise <[email protected]> Co-authored-by: Sindhu-Vasireddy <[email protected]> Co-authored-by: Seb Hickman <[email protected]> Co-authored-by: Julian Kuehnert <[email protected]> Co-authored-by: ankitpatnala <[email protected]> Co-authored-by: Patnala,Ankit <[email protected]> Co-authored-by: Savvas Melidonis <[email protected]> Co-authored-by: Christian Lessig <[email protected]> Co-authored-by: Till Hauer <[email protected]> Co-authored-by: Simon Grasse <[email protected]> Co-authored-by: Michael <[email protected]>

kacpnowak and others added 30 commits June 16, 2025 12:21

Implement mock IO (#336)

4767a18

Adapt score class score class (#339)

6709f5b

* Implement mock IO * Adapt score class

Removing unused file (#349)

ff6609f

remove database folder (#355)

8ab2a00

Small change - CI - pinning the version of formatting (#361)

78a3447

* changes * changes

Update INSTALL.md

a6a6e33

Update INSTALL.md

7cf7560

Fixed Exxx lint issues (#284)

98e91ab

* Rebased to the latest changes and linted new changes * addressed review comments * addressed review comments * Linted the latest changes. * corrected the formating * corrected the formating * configured ruff to use LF line endings in pyproject.toml

[357] Sub-package for evaluation (#359)

dfd3461

* working * changes * removing deps from non-core project * changes * fixes * comments

Iluise quick fix stac (#374)

a9f343d

* remove database folder * fix database

Simplifying workflow for plot_training (#368)

f4b73ab

* Simplifying workflow for plot_training * Ruffed * Working on implementing exclude_source * Remove unused code * Fixed ruff issue

Fixing bug in lat handling (377) (#378)

b195152

* Fixing bug in lat handling * Added comment --------- Co-authored-by: Seb Hickman <[email protected]>

recover num_ranks from previous run to calculate epoch_base (#317)

3b0fbd6

* recover num_ranks from previous run to calculate epoch_base * set email settings for commits * addressing Tim's comment * make ruff happy * improve style

changes (#385)

30882f5

Linter rule so np.ndarray is not used as type

Merge branch 'develop' into grassesi/dev/hackathon_evaluation

627995b

Introduce tuples instead for strings to avoid TypeError (#392)

5856f8f

add embed_dropout_rate to config v1 (#358)

5850cc2

[402] adds checks to the pull request (#403)

7d68271

* chanegs * mistake * mistake * mistake * changes * doc

[346] Passing options through the slurm script (#400)

7e53b86

* changes * fixes

refactor validation_io.write_validation to make it more readable

924dbfa

remove legacy code validation_io.read_validation

1dee399

encapsulate artifact path logic in config module

7a0321f

remove redundant attribute Trainer.path_run

7feb7af

use config to look up base_path in write_validation

9301fef

remove unused write_validation args: base_path, rank

f5c3e00

ensure correct type for pathes

0293665

mlangguth89 and others added 6 commits July 4, 2025 15:50

Add missing import of MockIO.

c0b8d90

merge with dev

2b6272a

Merge remote-tracking branch 'origin/develop' into tjh/dev/issue_251_2

2bd1c4a

changes

bca96ce

Merge branch 'develop' into tjh/dev/issue_251_2

36ae050

changes

beea29c

tjhunter marked this pull request as draft July 7, 2025 15:59

tjhunter changed the title ~~[251] Merge new IO class~~ [251] DRAFT - Merge new IO class Jul 7, 2025

tjhunter added 16 commits July 7, 2025 18:00

removing the scores

ef916b0

changes

24877c9

changes

4d05fbf

changes

4fbe29f

changes

c62e837

changes

5850e48

changes

29b2e50

changes

660e2ad

changes

ab66d0b

changes

46e45fd

Merge remote-tracking branch 'origin/develop' into tjh/dev/issue_251_2

0d83e22

changes

b66732e

changes

0b4accb

changes

f75ba3a

changes

f4f5bac

changes

60661b2

tjhunter changed the title ~~[251] DRAFT - Merge new IO class~~ [251] Merge new IO class Jul 8, 2025

tjhunter marked this pull request as ready for review July 8, 2025 12:06

tjhunter merged commit eab4b14 into develop Jul 8, 2025
3 checks passed

tjhunter mentioned this pull request Jul 8, 2025

Sgrasse/develop/issue 251 #365

Open

13 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[251] Merge new IO class #469

[251] Merge new IO class #469

Uh oh!

tjhunter commented Jul 7, 2025

Uh oh!

tjhunter commented Jul 8, 2025

Uh oh!

Uh oh!

Uh oh!

[251] Merge new IO class #469

[251] Merge new IO class #469

Uh oh!

Conversation

tjhunter commented Jul 7, 2025

Description

Type of Change

Issue Number

Uh oh!

tjhunter commented Jul 8, 2025

Uh oh!

Uh oh!

Uh oh!