-
Notifications
You must be signed in to change notification settings - Fork 2.9k
NVIDIA Megatron-LM Discussions
Sort by:
Latest activity
Categories, most helpful, and community links
Categories
Community links
Discussions
-
You must be logged in to vote π [QUESTION] How to set
stale--rotary-seq-len-interpolation-factor
for rope scaling?No activity in 60 days on issue or PR -
You must be logged in to vote π [QUESTION] How to re-initialize process group after destroy_process_group() ?
staleNo activity in 60 days on issue or PR -
You must be logged in to vote π How to split the dataset when running pretrain_bert.py
staleNo activity in 60 days on issue or PR -
You must be logged in to vote π [QUESTION] Why write a special LinearWithFrozenWeight?
staleNo activity in 60 days on issue or PR -
You must be logged in to vote π question about test_global_memory_buffer
staleNo activity in 60 days on issue or PR