Skip to content

Commit abddc85

Browse files
ko3n1ggithub-actions[bot]chtruong814
authored
Update changelog for r2.3.1 (#13719) (#13725)
* beep boop: Update changelog * Add highlights to 2.3.1 changelog * Re-add changelog comment --------- Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Signed-off-by: Charlie Truong <[email protected]> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Charlie Truong <[email protected]>
1 parent 7fbf299 commit abddc85

File tree

1 file changed

+53
-0
lines changed

1 file changed

+53
-0
lines changed

CHANGELOG.md

Lines changed: 53 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,59 @@
11
# Changelog
22

33
<!-- Next changelog -->
4+
## NVIDIA Neural Modules 2.3.1
5+
6+
### Highlights
7+
8+
- Collections
9+
- LLM
10+
- Llama 4: Fixed an accuracy issue caused by MoE probability normalization. Improved pre-train and fine-tune performance.
11+
- Export & Deploy
12+
- Updated vLLMExporter to use vLLM V1 to address a security vulnerability.
13+
- AutoModel
14+
- Improved chat-template handling.
15+
- Fault Tolerance
16+
- Local checkpointing: Fixed support for auto-inserted metric names for resuming from local checkpoints.
17+
18+
### Detailed Changelogs:
19+
20+
</details>
21+
22+
#### Export
23+
24+
<details><summary>Changelog</summary>
25+
26+
- Cherry-pick `Update vLLMExporter to use vLLM V1` (#13498) into `r2.3.0` by @chtruong814 :: PR: #13631
27+
28+
</details>
29+
30+
#### Uncategorized:
31+
32+
<details><summary>Changelog</summary>
33+
34+
- Bump to 2.3.1 by @chtruong814 :: PR: #13507
35+
- Cherry pick `Use explicitly cached canary-1b-flash in CI tests (13237)` into `r2.3.0` by @ko3n1g :: PR: #13508
36+
- Cherry pick `[automodel] bump liger-kernel to 0.5.8 + fallback (13260)` into `r2.3.0` by @ko3n1g :: PR: #13308
37+
- Cherry-pick `Add recipe and ci scripts for qwen2vl` to `r2.3.0` by @romanbrickie :: PR: #13336
38+
- Cherry pick `Fix skipme handling (13244)` into `r2.3.0` by @ko3n1g :: PR: #13376
39+
- Cherry pick `Allow fp8 param gather when using FSDP (13267)` into `r2.3.0` by @ko3n1g :: PR: #13383
40+
- Cherry pick `Handle boolean args for performance scripts and log received config (13291)` into `r2.3.0` by @ko3n1g :: PR: #13416
41+
- Cherry pick `new perf configs (13110)` into `r2.3.0` by @ko3n1g :: PR: #13431
42+
- Cherry pick `Adding additional unit tests for the deploy module (13411)` into `r2.3.0` by @ko3n1g :: PR: #13449
43+
- Cherry pick `Adding more export tests (13410)` into `r2.3.0` by @ko3n1g :: PR: #13450
44+
- Cherry pick `[automodel] add FirstRankPerNode (13373)` into `r2.3.0` by @ko3n1g :: PR: #13559
45+
- Cherry pick `[automodel] deprecate global_batch_size dataset argument (13137)` into `r2.3.0` by @ko3n1g :: PR: #13560
46+
- Cherry-pick `[automodel] fallback FP8 + LCE -> FP8 + CE` (#13349) into `r2.3.0` by @chtruong814 :: PR: #13561
47+
- Cherry pick `[automodel] add find_unused_parameters=True for DDP (13366)` into `r2.3.0` by @ko3n1g :: PR: #13601
48+
- Cherry pick `Add CI test for local checkpointing (#13012)` into `r2.3.0` by @ananthsub :: PR: #13472
49+
- Cherry pick `[automodel] fix --mbs/gbs dtype and chat-template (13598)` into `r2.3.0` by @akoumpa :: PR: #13613
50+
- Cherry-pick `Update t5.py` (#13082) to `r2.3.0` and `bump mcore to f98b1a0` by @chtruong814 :: PR: #13642
51+
- [Automodel] Fix CP device_mesh issue, use PTL distsampler (#13473) by @akoumpa :: PR: #13636
52+
- [Llama4] Fix the recipe bug - cherrypick #13649 by @gdengk :: PR: #13650
53+
- build: Pin transformers (#13675) by @ko3n1g :: PR: #13692
54+
55+
</details>
56+
457
## NVIDIA Neural Modules 2.3.0
558

659
### Highlights

0 commit comments

Comments
 (0)