You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+3-3Lines changed: 3 additions & 3 deletions
Original file line number
Diff line number
Diff line change
@@ -172,7 +172,7 @@ A multi-platform and open-source software able to create synthetic image documen
172
172
173
173
## Natural Language Processing
174
174
175
-
#### - [Contextual data augmentation](https://github.com/pfnet-research/contextual_augmentation) - Contextual augmentation is a domain-independent data augmentation for text classification tasks. Texts in supervised dataset are augmented by replacing words with other words which are predicted by a label-conditioned bi-directional language model.
175
+
#### - [Contextual data augmentation](https://github.com/pfnet-research/contextual_augmentation)- Contextual augmentation is a domain-independent data augmentation for text classification tasks. Texts in supervised dataset are augmented by replacing words with other words which are predicted by a label-conditioned bi-directional language model.
176
176
This repository contains a collection of scripts for an experiment of [Contextual Augmentation](https://arxiv.org/pdf/1805.06201.pdf).
177
177
178
178

@@ -196,9 +196,9 @@ Features:
196
196
-**Random Swap (RS):** Randomly choose two words in the sentence and swap their positions. Do this *n* times.
197
197
-**Random Deletion (RD):** For each word in the sentence, randomly remove it with probability *p*.
198
198
199
-
#### - [Wiki Edits](https://github.com/snukky/wikiedits) - A collection of scripts for automatic extraction of edited sentences from text edition histories, such as Wikipedia revisions. It was used to create the WikEd Error Corpus --- a corpus of corrective Wikipedia edits. The corpus has been prepared for two languages: Polish and English. Can be used as a dictionary-based augmentatioon to insert user-induced errors.
199
+
#### - [Wiki Edits](https://github.com/snukky/wikiedits) - A collection of scripts for automatic extraction of edited sentences from text edition histories, such as Wikipedia revisions. It was used to create the WikEd Error Corpus --- a corpus of corrective Wikipedia edits. The corpus has been prepared for two languages: Polish and English. Can be used as a dictionary-based augmentatioon to insert user-induced errors.
200
200
201
-
#### - [TextAttack 🐙](https://github.com/QData/TextAttack) - TextAttack is a Python framework for adversarial attacks, data augmentation, and model training in NLP.
201
+
#### - [TextAttack 🐙](https://github.com/QData/TextAttack) - TextAttack is a Python framework for adversarial attacks, data augmentation, and model training in NLP.
202
202
Many of the components of TextAttack are useful for data augmentation. The `textattack.Augmenter` class
203
203
uses a transformation and a list of constraints to augment data. We also offer five built-in recipes
204
204
for data augmentation [source:QData/TextAttack](https://github.com/QData/TextAttack):
0 commit comments