-
Notifications
You must be signed in to change notification settings - Fork 111
Pull requests: aws-samples/awsome-distributed-training
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Improvements/fsdp restructure
enhancement
New feature or request
refactoring
#630
opened Apr 3, 2025 by
mhuguesaws
•
Draft
Change PyTorch version for FSDP case and remove conda
#629
opened Apr 3, 2025 by
mhuguesaws
Loading…
Add automated Grafana dashboard deployment
enhancement
New feature or request
#607
opened Mar 25, 2025 by
mhuguesaws
Loading…
Terraform Modules for HyperPod EKS
enhancement
New feature or request
#586
opened Mar 14, 2025 by
bluecrayon52
Loading…
Update megatron lm testcase
enhancement
New feature or request
#537
opened Jan 30, 2025 by
KeitaW
Loading…
Update bionemo test case + propose to subdirectories per orchastrator
documentation
Improvements or additions to documentation
Update SMPv2 conda setup script with latest PT2.3.1 TSM2.4.0
#366
opened Jun 25, 2024 by
viclzhu
Loading…
End-to-End LLM Model Development with Torchtitan and Torchtune
enhancement
New feature or request
#341
opened May 20, 2024 by
KeitaW
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-03-07.