Replies: 2 comments 1 reply
-
Are you trying to find the best learning rate for each iteration of the training loop? Not sure if there's a better way to do it than the code example you provided. I wonder if there's a way to write a custom optax scheduler that factors in the resulting loss depending on which learning rate you use? |
Beta Was this translation helpful? Give feedback.
1 reply
-
You can probably |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I'm working on some code where I'm trying to see if it helps to incorporate some sort of manual line search for the choice of step-size used. I'm checking across a small list of step-sizes to check the best step-size to take. This is the way I've got it currently implemented.
Is there a better way to do this? I thought I could maybe vmap across the step-sizes but that does blow up the memory a fair bit
Beta Was this translation helpful? Give feedback.
All reactions