Skip to content

Latest commit

 

History

History
84 lines (68 loc) · 3.45 KB

Data_Augment.md

File metadata and controls

84 lines (68 loc) · 3.45 KB

Data Augmentation

Blog

AutoClean

AutoLabel

NLPCDA

EDA

UDA

  • link: https://github.com/google-research/uda
  • author: Google Research
  • paper: Xie, Q. , Dai, Z. , Hovy, E. , Luong, M. T. , & Le, Q. V. . (2019). Unsupervised data augmentation for consistency training.
  • note: unsupervised aata augmentation code.

NLPAUG

TextAttack

LaserTagger

python-pinyin

TextFlint

  • link: https://github.com/textflint/textflint
  • author: Fudan University NLP Group
  • note: a multilingual robustness evaluation platform for natural language processing, which unifies text transformation, sub-population, adversarial attack, and their combinations to provide a comprehensive robustness analysis.

deduplicate-text-datasets

AugLy

Mutate

Text preprocessing for Natural Language Processing