Generalize T5 modules #5166

AkshitaB · 2021-04-29T08:10:42Z

Changes proposed in this pull request:

Adding AttentionModule (naming it so instead of just Attention to differentiate it from allennlp.modules.attention.Attention)
SelfAttention and T5Attention inherit from this.

…nto t5-generalize

epwalsh

Just a couple questions/comments about organization so far:

Could "general attention" just be called "attention"?
Could we put the T5Attention in the t5 module?
Can we get rid of T5AttentionOld?

AkshitaB · 2021-05-06T22:57:44Z

Just a couple questions/comments about organization so far:

Could "general attention" just be called "attention"?

Could we put the T5Attention in the t5 module?

Can we get rid of T5AttentionOld?

Yes to all 3. I'm still debugging a thing with loading pretrained weights, will clean this up in the next commit. This was to get some feedback on the GeneralAttention module itself; does it make sense? Is the logical flow clear enough in terms of readability?

allennlp/modules/transformer/attention_module.py

allennlp/modules/transformer/transformer_layer.py

allennlp/modules/transformer/util.py

epwalsh

I don't have much to say in addition to Dirk's comments. As long as tests pass this LGTM!

allennlp/modules/transformer/t5.py

…nto t5-generalize

epwalsh

Just a couple super small suggestions / questions.

We can probably delete allennlp.modules.transformer.self_attention, right?

allennlp/modules/transformer/attention_module.py

allennlp/modules/transformer/transformer_stack.py

Co-authored-by: Pete <[email protected]>

epwalsh

💯

* initial commit * general self attn * fixing bugs, adding tests, adding docs * updating other modules * refactor * bug fix * update changelog * fix shape * fix format * address feedback * small doc fix * Update allennlp/modules/transformer/transformer_stack.py Co-authored-by: Pete <[email protected]> * remove old file Co-authored-by: epwalsh <[email protected]> Co-authored-by: Pete <[email protected]>

initial commit

70f8f92

AkshitaB marked this pull request as draft April 29, 2021 08:10

AkshitaB added 4 commits May 5, 2021 02:14

general self attn

ea93e9e

Merge branch 'main' into t5-generalize

16b5a8e

fixing bugs, adding tests, adding docs

3b73050

Merge branch 't5-generalize' of https://github.com/allenai/allennlp i…

bbba136

…nto t5-generalize

AkshitaB marked this pull request as ready for review May 6, 2021 16:46

AkshitaB requested review from dirkgr and epwalsh May 6, 2021 16:46

epwalsh reviewed May 6, 2021

View reviewed changes

AkshitaB added 7 commits May 6, 2021 15:57

Merge branch 'main' into t5-generalize

b941085

Merge branch 'main' into t5-generalize

de76527

updating other modules

5ce9865

refactor

6e7243f

bug fix

d252af0

update changelog

2eecfc4

fix shape

86af3cb

AkshitaB requested a review from epwalsh May 25, 2021 19:45

AkshitaB added 3 commits May 26, 2021 09:53

Merge branch 'main' into t5-generalize

15dec4f

fix format

e90574a

Merge branch 'main' into t5-generalize

c624cb5

dirkgr suggested changes May 27, 2021

View reviewed changes

epwalsh reviewed Jun 1, 2021

View reviewed changes

allennlp/modules/transformer/t5.py Outdated Show resolved Hide resolved

AkshitaB added 3 commits June 1, 2021 17:01

address feedback

885808f

Merge branch 't5-generalize' of https://github.com/allenai/allennlp i…

495746a

…nto t5-generalize

Merge branch 'main' into t5-generalize

c489b3e

AkshitaB requested a review from epwalsh June 2, 2021 00:17

AkshitaB and others added 2 commits June 2, 2021 13:45

Merge branch 'main' into t5-generalize

2e412b6

small doc fix

7d08c68

epwalsh suggested changes Jun 2, 2021

View reviewed changes

allennlp/modules/transformer/attention_module.py Show resolved Hide resolved

allennlp/modules/transformer/transformer_stack.py Outdated Show resolved Hide resolved

AkshitaB and others added 2 commits June 2, 2021 13:54

Update allennlp/modules/transformer/transformer_stack.py

e455b6a

Co-authored-by: Pete <[email protected]>

remove old file

da08ac2

epwalsh approved these changes Jun 2, 2021

View reviewed changes

AkshitaB merged commit b0aa1d4 into main Jun 2, 2021

AkshitaB deleted the t5-generalize branch June 2, 2021 21:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generalize T5 modules #5166

Generalize T5 modules #5166

AkshitaB commented Apr 29, 2021 •

edited

Loading

epwalsh left a comment

AkshitaB commented May 6, 2021

epwalsh left a comment

epwalsh left a comment

epwalsh left a comment

Generalize T5 modules #5166

Generalize T5 modules #5166

Conversation

AkshitaB commented Apr 29, 2021 • edited Loading

epwalsh left a comment

Choose a reason for hiding this comment

AkshitaB commented May 6, 2021

epwalsh left a comment

Choose a reason for hiding this comment

epwalsh left a comment

Choose a reason for hiding this comment

epwalsh left a comment

Choose a reason for hiding this comment

AkshitaB commented Apr 29, 2021 •

edited

Loading