Transformers from scratch with JAX/Flax #2905
Unanswered
kenkenpa2126
asked this question in
Show and tell
Replies: 3 comments 1 reply
-
It looks like a good candidate for this section here: https://flax.readthedocs.io/en/latest/examples_community_examples.html#examples |
Beta Was this translation helpful? Give feedback.
0 replies
-
Also, I believe @levskaya and @IvyZX were working on a Flax implementation of a transformer? |
Beta Was this translation helpful? Give feedback.
1 reply
-
Thanks for your comments! I'll make a PR after I write some explanation. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi, I'm Kenta, a student in Japan.
I started a tutorial to understand Transformer models through implementation with JAX/Flax.
As the first step, even though I haven't finished writing detailed explanations yet, I released the notebook of Vanilla Transformer.
https://github.com/kenkenpa2126/Transformers_from_scratch_with_JAX-Flax
I aim to register it to the community example as a tutorial and wonder how to apply it.
Is it OK to list it on this page directly and make a PR?
https://github.com/google/flax/blob/main/docs/examples_community_examples.rst
And also, If you gave me some advice or comments about the tutorial, I would appreciate it very much!
Beta Was this translation helpful? Give feedback.
All reactions