Open
Description
Do you have any plans to apply the recently published Reinforced Self-Training (ReST)?
Reinforced Self-Training (ReST) for Language Modeling
https://arxiv.org/abs/2308.08998
Metadata
Metadata
Assignees
Labels
No labels
Do you have any plans to apply the recently published Reinforced Self-Training (ReST)?
Reinforced Self-Training (ReST) for Language Modeling
https://arxiv.org/abs/2308.08998