Implementing abstraction to score final sequences in `BeamSearch` #5208

danieldeutsch · 2021-05-17T19:44:13Z

Implements feature requests 2 and 3 from #5205 and closes #5113.

Changes proposed in this pull request:

Other libraries, such as transformers or fairseq pick the top outputs from beam search by taking the sequence with the highest average log probability per token. See here for the transformers implementation. The AllenNLP implementation takes the sequence with the best total log probability of the sequence. This change adds an abstraction called FinalSequenceScorer to decide how the final sequences will be scored. The default is SequenceLogProbabilityScorer, which assigns a score equal to sum of the log probabilities per token (i.e., the current implementation). This PR also includes LengthNormalizedSequenceLogProbabilityScorer which assigns a score equal to the average log probability per token.
The LengthNormalizedSequenceLogProbabilityScorer also has a length_penalty parameter, which will increase or decrease sequence scores based on their length. This is also included in the transformers beam search.

Notes

This change does not break the code, but it does change the semantics of the second tensor returned by the search method in BeamSearch. Previously it was the log probabilities of the sequences, but now it is the score that is returned by the FinalSequenceScorer class. The new default is to return what is currently returned, so this should not break any models.

Before submitting

I've read and followed all steps in the Making a pull request
section of the CONTRIBUTING docs.
I've updated or added any relevant docstrings following the syntax described in the
Writing docstrings section of the CONTRIBUTING docs.
If this PR fixes a bug, I've added a test that will fail without my fix.
If this PR adds a new feature, I've added tests that sufficiently cover my new functionality.

After submitting

All GitHub Actions jobs for my pull request have passed.
codecov/patch reports high test coverage (at least 90%).
You can find this under the "Actions" tab of the pull request once the other checks have finished.

epwalsh

This also looks great! I appreciate the thorough documentation 🙂 Just a few minor comments

allennlp/nn/beam_search.py

Co-authored-by: Pete <[email protected]>

epwalsh

Looks great! Thank you @danieldeutsch!

…lenai#5208) * Implementing FinalSequenceScorer in BeamSearch * Including the end token in the normalization * Reformating * Apply suggestions from code review Co-authored-by: Pete <[email protected]> * Sorting the sequences by the final scores Co-authored-by: Pete <[email protected]> Co-authored-by: Pete <[email protected]>

danieldeutsch and others added 4 commits May 17, 2021 15:02

Implementing FinalSequenceScorer in BeamSearch

6858f8e

Including the end token in the normalization

36e8759

Reformating

789020d

Merge branch 'main' into sequence-scoring

639f4a6

epwalsh reviewed May 17, 2021

View reviewed changes

allennlp/nn/beam_search.py Show resolved Hide resolved

allennlp/nn/beam_search.py Outdated Show resolved Hide resolved

allennlp/nn/beam_search.py Outdated Show resolved Hide resolved

allennlp/nn/beam_search.py Outdated Show resolved Hide resolved

epwalsh enabled auto-merge (squash) May 17, 2021 21:30

Apply suggestions from code review

3483686

Co-authored-by: Pete <[email protected]>

auto-merge was automatically disabled May 18, 2021 01:21
Head branch was pushed to by a user without write access

danieldeutsch added 2 commits May 17, 2021 22:04

Sorting the sequences by the final scores

082fc36

Merging main into branch

600931b

epwalsh approved these changes May 18, 2021

View reviewed changes

epwalsh merged commit 3585c9f into allenai:main May 18, 2021

danieldeutsch deleted the sequence-scoring branch May 18, 2021 17:00

danieldeutsch mentioned this pull request May 20, 2021

Implement a ROUGE metric that faithfully reproduces the official metric written in perl. #5153

Open

10 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementing abstraction to score final sequences in `BeamSearch` #5208

Implementing abstraction to score final sequences in `BeamSearch` #5208

danieldeutsch commented May 17, 2021

epwalsh left a comment •

edited

Loading

epwalsh left a comment

Implementing abstraction to score final sequences in BeamSearch #5208

Implementing abstraction to score final sequences in BeamSearch #5208

Conversation

danieldeutsch commented May 17, 2021

Notes

Before submitting

After submitting

epwalsh left a comment • edited Loading

Choose a reason for hiding this comment

epwalsh left a comment

Choose a reason for hiding this comment

Implementing abstraction to score final sequences in `BeamSearch` #5208

Implementing abstraction to score final sequences in `BeamSearch` #5208

epwalsh left a comment •

edited

Loading