|
1 | 1 | # Changelog
|
2 | 2 |
|
3 | 3 | <!-- Next changelog -->
|
| 4 | +## NVIDIA Neural Modules 2.3.1 |
| 5 | + |
| 6 | +### Highlights |
| 7 | + |
| 8 | +- Collections |
| 9 | + - LLM |
| 10 | + - Llama 4: Fixed an accuracy issue caused by MoE probability normalization. Improved pre-train and fine-tune performance. |
| 11 | +- Export & Deploy |
| 12 | + - Updated vLLMExporter to use vLLM V1 to address a security vulnerability. |
| 13 | +- AutoModel |
| 14 | + - Improved chat-template handling. |
| 15 | +- Fault Tolerance |
| 16 | + - Local checkpointing: Fixed support for auto-inserted metric names for resuming from local checkpoints. |
| 17 | + |
| 18 | +### Detailed Changelogs: |
| 19 | + |
| 20 | +</details> |
| 21 | + |
| 22 | +#### Export |
| 23 | + |
| 24 | +<details><summary>Changelog</summary> |
| 25 | + |
| 26 | +- Cherry-pick `Update vLLMExporter to use vLLM V1` (#13498) into `r2.3.0` by @chtruong814 :: PR: #13631 |
| 27 | + |
| 28 | +</details> |
| 29 | + |
| 30 | +#### Uncategorized: |
| 31 | + |
| 32 | +<details><summary>Changelog</summary> |
| 33 | + |
| 34 | +- Bump to 2.3.1 by @chtruong814 :: PR: #13507 |
| 35 | +- Cherry pick `Use explicitly cached canary-1b-flash in CI tests (13237)` into `r2.3.0` by @ko3n1g :: PR: #13508 |
| 36 | +- Cherry pick `[automodel] bump liger-kernel to 0.5.8 + fallback (13260)` into `r2.3.0` by @ko3n1g :: PR: #13308 |
| 37 | +- Cherry-pick `Add recipe and ci scripts for qwen2vl` to `r2.3.0` by @romanbrickie :: PR: #13336 |
| 38 | +- Cherry pick `Fix skipme handling (13244)` into `r2.3.0` by @ko3n1g :: PR: #13376 |
| 39 | +- Cherry pick `Allow fp8 param gather when using FSDP (13267)` into `r2.3.0` by @ko3n1g :: PR: #13383 |
| 40 | +- Cherry pick `Handle boolean args for performance scripts and log received config (13291)` into `r2.3.0` by @ko3n1g :: PR: #13416 |
| 41 | +- Cherry pick `new perf configs (13110)` into `r2.3.0` by @ko3n1g :: PR: #13431 |
| 42 | +- Cherry pick `Adding additional unit tests for the deploy module (13411)` into `r2.3.0` by @ko3n1g :: PR: #13449 |
| 43 | +- Cherry pick `Adding more export tests (13410)` into `r2.3.0` by @ko3n1g :: PR: #13450 |
| 44 | +- Cherry pick `[automodel] add FirstRankPerNode (13373)` into `r2.3.0` by @ko3n1g :: PR: #13559 |
| 45 | +- Cherry pick `[automodel] deprecate global_batch_size dataset argument (13137)` into `r2.3.0` by @ko3n1g :: PR: #13560 |
| 46 | +- Cherry-pick `[automodel] fallback FP8 + LCE -> FP8 + CE` (#13349) into `r2.3.0` by @chtruong814 :: PR: #13561 |
| 47 | +- Cherry pick `[automodel] add find_unused_parameters=True for DDP (13366)` into `r2.3.0` by @ko3n1g :: PR: #13601 |
| 48 | +- Cherry pick `Add CI test for local checkpointing (#13012)` into `r2.3.0` by @ananthsub :: PR: #13472 |
| 49 | +- Cherry pick `[automodel] fix --mbs/gbs dtype and chat-template (13598)` into `r2.3.0` by @akoumpa :: PR: #13613 |
| 50 | +- Cherry-pick `Update t5.py` (#13082) to `r2.3.0` and `bump mcore to f98b1a0` by @chtruong814 :: PR: #13642 |
| 51 | +- [Automodel] Fix CP device_mesh issue, use PTL distsampler (#13473) by @akoumpa :: PR: #13636 |
| 52 | +- [Llama4] Fix the recipe bug - cherrypick #13649 by @gdengk :: PR: #13650 |
| 53 | +- build: Pin transformers (#13675) by @ko3n1g :: PR: #13692 |
| 54 | + |
| 55 | +</details> |
| 56 | + |
4 | 57 | ## NVIDIA Neural Modules 2.3.0
|
5 | 58 |
|
6 | 59 | ### Highlights
|
|
0 commit comments