Skip to content

Commit 032e487

Browse files
Yevgeni Litvinjsgoller1
authored andcommitted
More info into 0.6.0 release notes.
1 parent b55a6c2 commit 032e487

File tree

1 file changed

+13
-3
lines changed

1 file changed

+13
-3
lines changed

docs/release-notes.rst

Lines changed: 13 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -7,6 +7,8 @@ Release notes
77
Release 0.6.0
88
=============
99

10+
Thanks to our new contributers: Kim Hammar and Joshua Goller!
11+
1012
Breaking changes
1113
----------------
1214
- ``petastorm.etl.dataset_metadata.materialize_dataset()`` should be passed a filesystem factory method
@@ -15,12 +17,20 @@ Breaking changes
1517

1618
New features and bug fixes
1719
--------------------------
20+
- Added functionality for transform-on-worker thread/pool. The transform enables PyTorch users to run preprocessing
21+
code on worker processes/threads. It enables Tensorflow users to parallelize Python preprocessing code on
22+
a process pool, as part of the training/evaluation graph. Users now specify a ``transform_spec`` when calling
23+
``make_reader()`` or ``make_batch_reader()``.
24+
- Added ``hdfs_driver`` argument to the following functions: ``get_schema_from_dataset_url``, ``FilesystemResolver``,
25+
``generate_petastorm_metadata``, ``build_rowgroup_index``, ``RowGroupLoader``, ``dataset_as_rdd`` and ``copy_dataset``
1826
- the Docker container in ``/docker`` has been made into a workspace container aimed to support development on MacOS.
1927
- New `hello_world` examples added for using non-Petastorm datasets.
20-
- Added functionality for transform-on-worker thread/pool. Users now specify a ``transform_spec`` when calling ``make_reader()``
21-
or ``make_batch_reader()``
22-
- Fixed a bug that caused all columns of a dataset to be read when ``schema_fields=NGram(...)`` was used.
2328
- Allow for unicode strings to be passed as regex filters in Unischema when selecting which columns to read.
29+
- Fixed a bug that caused all columns of a dataset to be read when ``schema_fields=NGram(...)`` was used.
30+
- Fixed type of an argument passed to a predicate when the predicate is defined on a numeric partition field
31+
- Support regular unicode strings as expressions as a value of make_reader's schema_fields argument.
32+
- Emit a warning when opening a Petastorm-created dataset using make_batch_reader (``make_batch_reader`` currently
33+
does not support Petastorm specific types, such as tensors).
2434

2535
Release 0.5.1
2636
=============

0 commit comments

Comments
 (0)