Updated getting started doc #2698

chiamp · 2022-12-08T10:40:40Z

Updated getting started doc, as part of content restructuring mentioned in #2627. View the doc here.

Re-named "Getting Started" to "Quick Start"

review-notebook-app · 2022-12-08T10:40:44Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

codecov-commenter · 2022-12-08T10:50:37Z

Codecov Report

Merging #2698 (4f2381e) into main (fec10eb) will increase coverage by 0.06%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##             main    #2698      +/-   ##
==========================================
+ Coverage   81.15%   81.22%   +0.06%     
==========================================
  Files          51       53       +2     
  Lines        5493     5636     +143     
==========================================
+ Hits         4458     4578     +120     
- Misses       1035     1058      +23

Impacted Files	Coverage Δ
flax/linen/partitioning.py	`79.06% <0.00%> (-3.15%)`	⬇️
flax/linen/module.py	`92.22% <0.00%> (-0.42%)`	⬇️
flax/io.py	`84.84% <0.00%> (-0.42%)`	⬇️
flax/errors.py	`85.58% <0.00%> (-0.13%)`	⬇️
flax/core/scope.py	`90.13% <0.00%> (ø)`
flax/linen/linear.py	`97.51% <0.00%> (ø)`
flax/linen/summary.py	`99.01% <0.00%> (ø)`
flax/linen/__init__.py	`100.00% <0.00%> (ø)`
flax/linen/recurrent.py	`100.00% <0.00%> (ø)`
flax/linen/activation.py	`100.00% <0.00%> (ø)`
... and 8 more

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

docs/getting_started.md

cgarciae · 2022-12-15T14:37:41Z

docs/getting_started.md

+  labels_onehot = jax.nn.one_hot(labels, num_classes=10)
+  return optax.softmax_cross_entropy(logits=logits, labels=labels_onehot).mean()


Suggested change

labels_onehot = jax.nn.one_hot(labels, num_classes=10)

return optax.softmax_cross_entropy(logits=logits, labels=labels_onehot).mean()

return optax.softmax_cross_entropy_with_integer_labels(

logits=logits, labels=labels).mean()

8bitmp3 · 2022-12-26T23:32:13Z

docs/getting_started.md

-This tutorial demonstrates how to construct a simple convolutional neural
+Welcome to Flax!
+
+Flax is an open source Python neural network library that's built on top of [JAX](https://github.com/google/jax). This tutorial demonstrates how to construct a simple convolutional neural
 network (CNN) using the [Flax](https://flax.readthedocs.io) Linen API and train


Nit: "and train it..."

8bitmp3 · 2022-12-26T23:32:38Z

docs/getting_started.md

-This tutorial demonstrates how to construct a simple convolutional neural
+Welcome to Flax!
+
+Flax is an open source Python neural network library that's built on top of [JAX](https://github.com/google/jax). This tutorial demonstrates how to construct a simple convolutional neural


Nit:

Suggested change

Flax is an open source Python neural network library that's built on top of [JAX](https://github.com/google/jax). This tutorial demonstrates how to construct a simple convolutional neural

Flax is an open source Python neural network library built on top of [JAX](https://github.com/google/jax). This tutorial demonstrates how to construct a simple convolutional neural

8bitmp3 · 2022-12-26T23:33:29Z

docs/getting_started.md

+executionInfo:
+  elapsed: 54
+  status: ok
+  timestamp: 1671500846075
+  user:
+    displayName: Marcus Chiam
+    userId: '17531616275590396120'
+  user_tz: 300
+id: a9633134


Can assist with cleaning up the Colab Jupyter metadata. @IvyZX may also have a method.

8bitmp3 · 2022-12-26T23:36:18Z

docs/getting_started.md


 import numpy as np                     # Ordinary NumPy
 import optax                           # Optimizers
+import tensorflow as tf                # Tensorflow to operate on TFDS


Nit: "TensorFlow"

Maybe:

Suggested change

import tensorflow as tf # Tensorflow to operate on TFDS

import tensorflow as tf # TensorFlow for certain ops like `tf.data.Dataset`

WDYT?

TF is used for like tf.random.set_seed(0) as well as tf.cast() and setting dtypes for e.g. tf.float32. Plus, we're using the tf.data.Dataset API (which is separate from TFDS, AFAIK). TFDS is tensorflow_datasets.

8bitmp3 · 2022-12-26T23:37:31Z

docs/getting_started.md


-## 2. Define network
+## 4. Define network

 Create a convolutional neural network with the Linen API by subclassing
 [Module](https://flax.readthedocs.io/en/latest/flax.linen.html#core-module-abstraction).


Nit:

In other guides we are/started using "Flax Module" since "module/Module" is a common word.

Maybe here:

Suggested change

[Module](https://flax.readthedocs.io/en/latest/flax.linen.html#core-module-abstraction).

[Flax Module](https://flax.readthedocs.io/en/latest/flax.linen.html#core-module-abstraction).

And then repeat this, which can help new users.

8bitmp3 · 2022-12-26T23:38:02Z

docs/getting_started.md


-Our function returns a simple scalar value ready for optimization, so we first take the mean of the vector shaped `[batch]` returned by Optax's loss function.
+Create an instance of the Module and use the [`Module.tabulate`](https://flax.readthedocs.io/en/latest/api_reference/flax.linen.html#flax.linen.Module.tabulate) method to visualize a table of the model layers by passing an RNG key and template image input.


As before:

Suggested change

Create an instance of the Module and use the [`Module.tabulate`](https://flax.readthedocs.io/en/latest/api_reference/flax.linen.html#flax.linen.Module.tabulate) method to visualize a table of the model layers by passing an RNG key and template image input.

Create an instance of the Flax Module and use the [`Module.tabulate`](https://flax.readthedocs.io/en/latest/api_reference/flax.linen.html#flax.linen.Module.tabulate) method to visualize a table of the model layers by passing an RNG key and template image input.

8bitmp3 · 2022-12-26T23:40:21Z

docs/getting_started.md

-+++ {"id": "lYz0Emry-ele"}
-
-## 5. Loading data
+We simply use `optax.softmax_cross_entropy()`. Note that this function expects both `logits` and `labels` to have shape `[batch, num_classes]`. Since the labels will be read from TFDS as integer values, we first need to convert them to a onehot encoding.


Nit:

Add a link to the Optax softmax cross entropy API doc and mention Optax since it's an external library.

If you can, use "second person" like "you"/"your" (Google Style Guide).

For example:

Suggested change

We simply use `optax.softmax_cross_entropy()`. Note that this function expects both `logits` and `labels` to have shape `[batch, num_classes]`. Since the labels will be read from TFDS as integer values, we first need to convert them to a onehot encoding.

For your loss, use a predefined [`optax.softmax_cross_entropy()`](https://optax.readthedocs.io/en/latest/api.html#optax.softmax_cross_entropy) from the Optax library. Note that this function expects both `logits` and `labels` to have shape `[batch, num_classes]`. Since the labels will be read from TFDS as integer values, first convert them to a one-hot encoding.

8bitmp3 · 2022-12-26T23:42:00Z

docs/getting_started.md

 ```

-+++ {"id": "UMFK51rsAUX4"}
+++ {"id": "4b5ac16e"}

 ## 6. Create train state


Nit: Since it's a Flax term/class maybe use

Suggested change

## 6. Create train state

## 6. Create a `TrainState`

8bitmp3 · 2022-12-26T23:42:23Z

docs/getting_started.md

-that serves most basic usecases. Usually one would subclass it to add more data
-to be tracked, but in this example we can use it without any modifications.
+[`flax.training.train_state.TrainState`](https://flax.readthedocs.io/en/latest/flax.training.html#train-state)
+that serves most basic usecases. We can then subclass `TrainState` so that it also contains metrics.


Nit:

Suggested change

that serves most basic usecases. We can then subclass `TrainState` so that it also contains metrics.

that serves most basic usecases. You can then subclass `TrainState` so that it also contains metrics.

8bitmp3 · 2022-12-26T23:42:34Z

docs/getting_started.md

+  user_tz: 300
+id: e0102447
+---
+def create_train_state(module, rng, learning_rate, momentum):
  """Creates initial `TrainState`."""


Nit:

Suggested change

"""Creates initial `TrainState`."""

"""Creates an initial `TrainState`."""

8bitmp3 · 2022-12-26T23:45:57Z

docs/getting_started.md

 import jax
 import jax.numpy as jnp                # JAX NumPy

 from flax import linen as nn           # The Linen API
 from flax.training import train_state  # Useful dataclass to keep train state
+from flax import struct                # Flax dataclasses

 import numpy as np                     # Ordinary NumPy
 import optax                           # Optimizers


Suggested change

import optax # Optimizers

import optax # Optax for common losses and optimizers

8bitmp3 · 2022-12-26T23:46:28Z

docs/getting_started.md


-Define a function that loads and prepares the MNIST dataset and converts the
-samples to floating-point numbers.
+Our function returns a simple scalar value ready for optimization, so we first take the mean of the vector shaped `[batch]` returned by Optax's loss function.


Nit:

If you can, use "second person" like "you"/"your" (Google Style Guide).

Suggested change

Our function returns a simple scalar value ready for optimization, so we first take the mean of the vector shaped `[batch]` returned by Optax's loss function.

Your function returns a simple scalar value ready for optimization, make sure to first take the mean of the vector shaped `[batch]` returned by Optax's loss function.

8bitmp3 · 2022-12-26T23:47:52Z

docs/getting_started.md

-+++ {"id": "mHQi20yVCsSf"}
+++ {"id": "80fbb60b"}
+
+## 11. Initialize train state


As mentioned before, maybe:

Suggested change

## 11. Initialize train state

## 11. Initialize the `TrainState`

8bitmp3 · 2022-12-26T23:48:07Z

docs/getting_started.md

-  [data stored](https://flax.readthedocs.io/en/latest/design_notes/linen_design_principles.html#how-are-parameters-represented-and-how-do-we-handle-general-differentiable-algorithms-that-update-stateful-variables)
-  in a JAX
-  [pytree](https://jax.readthedocs.io/en/latest/pytrees.html#pytrees-and-jax-functions).
+- Set TF random seed to ensure dataset shuffling is reproducible.


Nit:

Suggested change

- Set TF random seed to ensure dataset shuffling is reproducible.

- Set the TF random seed to ensure dataset shuffling (with `tf.data.Dataset.shuffle`) is reproducible.

8bitmp3 · 2022-12-26T23:48:30Z

docs/getting_started.md


-## 14. Train and evaluate
+## 14. Inference on test set


Nit:

Suggested change

## 14. Inference on test set

## 14. Perform inference on the test set

8bitmp3 · 2022-12-26T23:50:27Z

docs/getting_started.md

 ```

-+++ {"id": "oKcRiQ89xQkF"}
+++ {"id": "edb528b6"}

 Congrats! You made it to the end of the annotated MNIST example. You can revisit


Nit:

Suggested change

Congrats! You made it to the end of the annotated MNIST example. You can revisit

Congratulations! You made it to the end of the annotated MNIST example. You can revisit

Congrats may be considered as slang (https://developers.google.com/style/translation#be-inclusive)

8bitmp3

Left a few minor suggestions. Feel free to add them/change them or ignore them 👍 Hope this helps!

cgarciae · 2023-01-04T15:46:09Z

docs/getting_started.md

@@ -30,62 +32,105 @@ If you see any changes between the two feel free to create a
 [pull request](https://github.com/google/flax/compare)


This is no longer true as we've heavily modified the notebook. Maybe we should remove this note?

cgarciae · 2023-01-04T16:37:58Z

docs/getting_started.md

+for test_batch in test_ds.as_numpy_iterator():
+  test_state = compute_metrics(state=test_state, batch=test_batch)
+  pred = state.apply_fn({'params': state.params}, test_batch['image']) # model inference
+  break # get only the first batch
+pred = pred.argmax(axis=1) # argmax the logits to get predicted labels


Realistically we might want to create jitted function for inference e.g:

Suggested change

for test_batch in test_ds.as_numpy_iterator():

test_state = compute_metrics(state=test_state, batch=test_batch)

pred = state.apply_fn({'params': state.params}, test_batch['image']) # model inference

break # get only the first batch

pred = pred.argmax(axis=1) # argmax the logits to get predicted labels

@jax.jit

def pred_step(state, batch):

logits = state.apply_fn({'params': state.params}, test_batch['image'])

return logits.argmax(axis=1)

test_batch = test_ds.as_numpy_iterator().next()

pred = pred_step(state, test_batch)

Even if we don't create pred_step its better to use:

test_batch = test_ds.as_numpy_iterator().next()

instead of break in the loop.

cgarciae · 2023-01-04T16:55:09Z

docs/getting_started.md

+def show_img(img, ax=None, title=None):
+  """Shows a single image."""
+  if ax is None:
+    ax = plt.gca()
+  ax.imshow(img[..., 0], cmap='gray')
+  ax.set_xticks([])
+  ax.set_yticks([])
+  if title:
+    ax.set_title(title)
+
+def show_img_grid(imgs, titles):
+  """Shows a grid of images."""
+  n = int(np.ceil(len(imgs)**.5))
+  _, axs = plt.subplots(n, n, figsize=(3 * n, 3 * n))
+  for i, (img, title) in enumerate(zip(imgs, titles)):
+    show_img(img, axs[i // n][i % n], title)
 ```

 ```{code-cell}
 ---
 colab:
-  base_uri: https://localhost:8080/
-id: ugGlV3u6Iq1A
-outputId: d0944ddb-8d5d-4e9f-9727-040789ef3f17
---
-for epoch in range(1, num_epochs + 1):
-  # Use a separate PRNG key to permute image data during shuffling
-  rng, input_rng = jax.random.split(rng)
-  # Run an optimization step over a training batch
-  state = train_epoch(state, train_ds, batch_size, epoch, input_rng)
-  # Evaluate on the test set after each training epoch
-  test_loss, test_accuracy = eval_model(state.params, test_ds)
-  print(' test epoch: %d, loss: %.2f, accuracy: %.2f' % (
-      epoch, test_loss, test_accuracy * 100))
+  height: 866
+executionInfo:
+  elapsed: 981
+  status: ok
+  timestamp: 1671500872908
+  user:
+    displayName: Marcus Chiam
+    userId: '17531616275590396120'
+  user_tz: 300
+id: 5d5nF3u44JFI
+outputId: 22e013f6-b9b7-4088-84f3-caaf248377ce
+---
+show_img_grid(
+    [test_batch['image'][idx] for idx in range(25)],
+    [f'label={pred[idx]}' for idx in range(25)],
+)


All of this can be reduced to:

fig, axs = plt.subplots(5, 5, figsize=(12, 12)) for i, ax in enumerate(axs.flatten()): ax.imshow(test_batch['image'][i, ..., 0], cmap='gray') ax.set_title(f"label={pred['label'][i]}") ax.axis('off')

chiamp · 2023-01-06T21:58:51Z

Thanks for the suggestions @cgarciae @8bitmp3! I made some updated changes.

cgarciae

Awesome @chiamp, looks very good! Approved.

8bitmp3 · 2023-01-14T00:04:21Z

docs/getting_started.md


 [![Open in Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/google/flax/blob/main/docs/getting_started.ipynb)
 [![Open On GitHub](https://img.shields.io/badge/Open-on%20GitHub-blue?logo=GitHub)](https://github.com/google/flax/blob/main/docs/getting_started.ipynb)

-# Getting Started
+# Quick Start


Nit: Both "quickstart" and "quick start" seem to be OK. To be consistent with "JAX Quickstart", would it make sense to name our doc "Quickstart" (one word) or "Flax quickstart"?

8bitmp3 · 2023-01-14T00:05:26Z

docs/getting_started.md

+## 2. Loading data
+
+Flax can use any
+data-loading pipeline and this example demonstrates how to utilize TFDS. Define a function that loads and prepares the MNIST dataset and converts the


Nit:

Suggested change

data-loading pipeline and this example demonstrates how to utilize TFDS. Define a function that loads and prepares the MNIST dataset and converts the

data-loading pipeline and this example demonstrates how to utilize TensorFlow Datasets (TFDS). Define a function that loads and prepares the MNIST dataset and converts the

Since we haven't mentioned TFDS before, it may help to spell out the full name of the library

chiamp self-assigned this Dec 8, 2022

chiamp marked this pull request as draft December 8, 2022 10:41

chiamp requested a review from cgarciae December 8, 2022 10:41

chiamp assigned cgarciae Dec 8, 2022

chiamp force-pushed the getting_started_doc branch from 0f603fd to c55c2da Compare December 14, 2022 05:24

cgarciae reviewed Dec 15, 2022

View reviewed changes

docs/getting_started.md Show resolved Hide resolved

cgarciae reviewed Dec 15, 2022

View reviewed changes

chiamp force-pushed the getting_started_doc branch from c55c2da to 4229fa7 Compare December 20, 2022 01:50

8bitmp3 reviewed Dec 26, 2022

View reviewed changes

cgarciae reviewed Jan 4, 2023

View reviewed changes

chiamp force-pushed the getting_started_doc branch from 4229fa7 to 4f2381e Compare January 6, 2023 21:32

chiamp marked this pull request as ready for review January 11, 2023 21:19

chiamp force-pushed the getting_started_doc branch 3 times, most recently from 90c8bde to bbd9379 Compare January 12, 2023 01:19

cgarciae approved these changes Jan 13, 2023

View reviewed changes

cgarciae added the pull ready label Jan 13, 2023

8bitmp3 reviewed Jan 14, 2023

View reviewed changes

8bitmp3 approved these changes Jan 14, 2023

View reviewed changes

Updated getting started doc

06c7160

chiamp force-pushed the getting_started_doc branch from bbd9379 to 06c7160 Compare January 17, 2023 23:37

copybara-service bot merged commit 71772f6 into google:main Jan 18, 2023

chiamp deleted the getting_started_doc branch January 18, 2023 04:30

		labels_onehot = jax.nn.one_hot(labels, num_classes=10)
		return optax.softmax_cross_entropy(logits=logits, labels=labels_onehot).mean()

	Flax is an open source Python neural network library that's built on top of [JAX](https://github.com/google/jax). This tutorial demonstrates how to construct a simple convolutional neural
	Flax is an open source Python neural network library built on top of [JAX](https://github.com/google/jax). This tutorial demonstrates how to construct a simple convolutional neural

	import tensorflow as tf # Tensorflow to operate on TFDS
	import tensorflow as tf # TensorFlow for certain ops like `tf.data.Dataset`

	[Module](https://flax.readthedocs.io/en/latest/flax.linen.html#core-module-abstraction).
	[Flax Module](https://flax.readthedocs.io/en/latest/flax.linen.html#core-module-abstraction).


		Our function returns a simple scalar value ready for optimization, so we first take the mean of the vector shaped `[batch]` returned by Optax's loss function.
		Create an instance of the Module and use the [`Module.tabulate`](https://flax.readthedocs.io/en/latest/api_reference/flax.linen.html#flax.linen.Module.tabulate) method to visualize a table of the model layers by passing an RNG key and template image input.

	Create an instance of the Module and use the [`Module.tabulate`](https://flax.readthedocs.io/en/latest/api_reference/flax.linen.html#flax.linen.Module.tabulate) method to visualize a table of the model layers by passing an RNG key and template image input.
	Create an instance of the Flax Module and use the [`Module.tabulate`](https://flax.readthedocs.io/en/latest/api_reference/flax.linen.html#flax.linen.Module.tabulate) method to visualize a table of the model layers by passing an RNG key and template image input.

	We simply use `optax.softmax_cross_entropy()`. Note that this function expects both `logits` and `labels` to have shape `[batch, num_classes]`. Since the labels will be read from TFDS as integer values, we first need to convert them to a onehot encoding.
	For your loss, use a predefined [`optax.softmax_cross_entropy()`](https://optax.readthedocs.io/en/latest/api.html#optax.softmax_cross_entropy) from the Optax library. Note that this function expects both `logits` and `labels` to have shape `[batch, num_classes]`. Since the labels will be read from TFDS as integer values, first convert them to a one-hot encoding.

	that serves most basic usecases. We can then subclass `TrainState` so that it also contains metrics.
	that serves most basic usecases. You can then subclass `TrainState` so that it also contains metrics.

	"""Creates initial `TrainState`."""
	"""Creates an initial `TrainState`."""

	import optax # Optimizers
	import optax # Optax for common losses and optimizers

	Our function returns a simple scalar value ready for optimization, so we first take the mean of the vector shaped `[batch]` returned by Optax's loss function.
	Your function returns a simple scalar value ready for optimization, make sure to first take the mean of the vector shaped `[batch]` returned by Optax's loss function.

	## 11. Initialize train state
	## 11. Initialize the `TrainState`

	- Set TF random seed to ensure dataset shuffling is reproducible.
	- Set the TF random seed to ensure dataset shuffling (with `tf.data.Dataset.shuffle`) is reproducible.

	## 14. Inference on test set
	## 14. Perform inference on the test set

	Congrats! You made it to the end of the annotated MNIST example. You can revisit
	Congratulations! You made it to the end of the annotated MNIST example. You can revisit

		@@ -30,62 +32,105 @@ If you see any changes between the two feel free to create a
		[pull request](https://github.com/google/flax/compare)

	data-loading pipeline and this example demonstrates how to utilize TFDS. Define a function that loads and prepares the MNIST dataset and converts the
	data-loading pipeline and this example demonstrates how to utilize TensorFlow Datasets (TFDS). Define a function that loads and prepares the MNIST dataset and converts the

Updated getting started doc #2698

Updated getting started doc #2698

Uh oh!

Conversation

chiamp commented Dec 8, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Dec 8, 2022

Uh oh!

codecov-commenter commented Dec 8, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

8bitmp3 Dec 26, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

8bitmp3 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

chiamp commented Jan 6, 2023

Uh oh!

cgarciae left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

chiamp commented Dec 8, 2022 •

edited

Loading

codecov-commenter commented Dec 8, 2022 •

edited

Loading

8bitmp3 Dec 26, 2022 •

edited

Loading