|
4 | 4 |
|
5 | 5 | ## [Unreleased]
|
6 | 6 |
|
| 7 | +## [2.0.0rc2](https://github.com/OpenNMT/OpenNMT-py/tree/2.0.0rc2) (2020-11-10) |
| 8 | + |
| 9 | +### Fixes and improvements |
| 10 | +* Parallelize onmt_build_vocab (422d824) |
| 11 | +* Some fixes to the on-the-fly transforms |
| 12 | +* Some CTranslate2 related updates |
| 13 | +* Some fixes to the docs |
| 14 | + |
| 15 | +## [2.0.0rc1](https://github.com/OpenNMT/OpenNMT-py/tree/2.0.0rc1) (2020-09-25) |
| 16 | + |
| 17 | +This is the first release candidate for OpenNMT-py major upgdate to 2.0.0! |
| 18 | + |
| 19 | +The major idea behind this release is the -- almost -- complete **makeover of the data loading pipeline** . A new 'dynamic' paradigm is introduced, allowing to apply on the fly transforms to the data. |
| 20 | + |
| 21 | +This has a few advantages, amongst which: |
| 22 | + |
| 23 | +* remove or drastically reduce the preprocessing required to train a model; |
| 24 | +* increase and simplify the possibilities of data augmentation and manipulation through on-the fly transforms. |
| 25 | + |
| 26 | +These transforms can be specific **tokenization** methods, **filters**, **noising**, or **any custom transform** users may want to implement. Custom transform implementation is quite straightforward thanks to the existing base class and example implementations. |
| 27 | + |
| 28 | +You can check out how to use this new data loading pipeline in the updated [docs and examples](https://opennmt.net/OpenNMT-py). |
| 29 | + |
| 30 | +All the **readily available transforms** are described [here](https://opennmt.net/OpenNMT-py/FAQ.html#what-are-the-readily-available-on-the-fly-data-transforms). |
| 31 | + |
| 32 | +### Performance |
| 33 | + |
| 34 | +Given sufficient CPU resources according to GPU computing power, most of the transforms should not slow the training down. (Note: for now, one producer process per GPU is spawned -- meaning you would ideally need 2N CPU threads for N GPUs). |
| 35 | + |
| 36 | +### Breaking changes |
| 37 | + |
| 38 | +A few features are dropped, at least for now: |
| 39 | + |
| 40 | +* audio, image and video inputs; |
| 41 | +* source word features. |
| 42 | + |
| 43 | +Some very old checkpoints with previous fields and vocab structure are also incompatible with this new version. |
| 44 | + |
| 45 | +For any user that still need some of these features, the previous codebase will be retained as [`legacy` in a separate branch](https://github.com/OpenNMT/OpenNMT-py/tree/legacy). It will no longer receive extensive development from the core team but PRs may still be accepted. |
| 46 | + |
| 47 | + |
| 48 | +----- |
| 49 | + |
7 | 50 | ## [1.2.0](https://github.com/OpenNMT/OpenNMT-py/tree/1.2.0) (2020-08-17)
|
8 | 51 | ### Fixes and improvements
|
9 | 52 | * Support pytorch 1.6 (e813f4d, eaaae6a)
|
|
0 commit comments