Skip to content
This repository was archived by the owner on Mar 19, 2024. It is now read-only.

Commit 94598ba

Browse files
prigoyalfacebook-github-bot
authored andcommitted
Explore ShardedDDP reduce_buffer_size setting to config (#177)
Summary: Pull Request resolved: #177 as title, in VISSL, we need to set the `reduce_buffer_size=0` as there are parameters that are not actually being used and `find_used_parameters` is something not handled by shardedDPP. setting buffer size to 0 will all reduce the gradients immediately instead of bucketing them Reviewed By: min-xu-ai Differential Revision: D26276800 fbshipit-source-id: 4bbe5a6e3a2b36b8a55abb6e120368025356db17
1 parent 9b2349d commit 94598ba

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

vissl/config/defaults.yaml

+3
Original file line numberDiff line numberDiff line change
@@ -258,6 +258,9 @@ config:
258258
# how many times the model should be checkpointed. User should tune this parameter
259259
# and find the number that offers best memory saving and compute tradeoff.
260260
NUM_ACTIVATION_CHECKPOINTING_SPLITS: 2
261+
# setup for Fairscale sharded DDP
262+
SHARDED_DDP_SETUP:
263+
reduce_buffer_size: -1
261264
# ----------------------------------------------------------------------------------- #
262265
# Feature evaluation settings
263266
# ----------------------------------------------------------------------------------- #

0 commit comments

Comments
 (0)