Add predict nonlinearity #662

BenjaminBossan · 2020-06-27T16:06:32Z

Resolves #637, resolves #661

Supersedes #572 #580

Added a parameter predict_nonlinearity to NeuralNet which allows users to control the nonlinearity to be applied to the module output when calling predict and predict_proba. Also, when using CrossEntropyLoss, softmax is now automatically applied to the output.

This PR implements what was discussed in #661. Regarding the implementation, I used the most simple approach at the moment (no dispatching based on the net class, for instance). Also, I added a get_predict_nonlinearity method to NeuralNet, which handles the case of 'auto' vs None vs callable.

Regarding the naming, I decided against output_nonlinearity. I believe predict_nonlinearity makes it much more obvious that this nonlinearity is not applied to the module output directly (which would affect loss etc.) but only when calling predict and predict_proba.

ping @qtux

Fails in PyTorch 1.1

Fails in PyTorch 1.1 There was one more instance in a unit test that needed fixing.

thomasjpfan

Thank you for working on this @BenjaminBossan !

skorch/net.py

skorch/tests/test_utils.py

thomasjpfan · 2020-07-04T18:26:48Z

skorch/net.py

@@ -1010,6 +1039,46 @@ def infer(self, x, **fit_params):
            return self.module_(**x_dict)
        return self.module_(x, **fit_params)

+    def get_predict_nonlinearity(self):


With this being public there will be two ways to adjust the nonlinearity:

With the predict_nonlinearity parameter in init.

Subclassing

Are we okay with this?

Good question, I was also unsure about this. The predict_nonlinearity parameter specifies what nonlinearity I want, whereas the get_predict_nonlinearity specifies how the nonlinearity selection is resolved. I would also be okay with making the latter private (with the possibility to make it public later if the need ever arises).

I would go with option 1 for now and having this function be public.

If a user were to override _get_predict_nonlinearity, we would not officially support it and it may break in the future.

Just to be sure, option 1 for you means making get_predict_nonlinearity private?

skorch/classifier.py

skorch/tests/test_net.py

Co-authored-by: Thomas J. Fan <[email protected]>

thomasjpfan

Otherwise LGTM

thomasjpfan · 2020-07-05T16:42:26Z

skorch/net.py

@@ -1010,6 +1039,46 @@ def infer(self, x, **fit_params):
            return self.module_(**x_dict)
        return self.module_(x, **fit_params)

+    def get_predict_nonlinearity(self):


I would go with option 1 for now and having this function be public.

If a user were to override _get_predict_nonlinearity, we would not officially support it and it may break in the future.

skorch/net.py

skorch/tests/test_net.py

Co-authored-by: Thomas J. Fan <[email protected]>

ottonemo

LGTM except comments

ottonemo · 2020-07-06T08:49:47Z

skorch/tests/test_classifier.py

@@ -289,7 +289,12 @@ def test_custom_loss_does_not_call_sigmoid(
        mock = Mock(side_effect=lambda x: x)
        monkeypatch.setattr(torch, "sigmoid", mock)

-        net = net_cls(module_cls, max_epochs=1, lr=0.1, criterion=nn.MSELoss)
+        # we need to add a custom nonlinearity, otherwise the output won't be 2d


this comment does not help me understand as it does not answer why the output won't be in 2d

ottonemo · 2020-07-06T09:02:10Z

skorch/tests/test_net.py

+        # don't want callbacks to trigger side effects
+        net.callbacks_ = []
+        net.partial_fit(X, y)
+        assert not side_effect


But wouldn't we expect callbacks such as accuracy scoring to call predict?

That's why I removed all callbacks two lines earlier, otherwise the test becomes very messy.

ottonemo · 2020-07-06T09:05:51Z

skorch/tests/test_net.py

+    def test_predict_nonlinearity_none(
+            self, net_cls, module_cls, data):
+        # even though we have CrossEntropyLoss, we don't want the
+        # output from predict_proba to be modified, since we set


Suggested change

# output from predict_proba to be modified, since we set

# output from predict_proba to be modified, thus we set

skorch/tests/test_utils.py

ottonemo · 2020-07-06T09:09:10Z

skorch/tests/test_utils.py

+
+    def test_infer_neural_binary_net_classifier_default(
+            self, infer_predict_nonlinearity, net_bin_clf_cls, module_cls):
+        # BCEWithLogitsLoss should return valid probabilities


Suggested change

# BCEWithLogitsLoss should return valid probabilities

# BCEWithLogitsLoss criterion: nonlinearity should return valid probabilities

ottonemo · 2020-07-06T09:15:20Z

skorch/utils.py

+
+    Sigmoid is applied to x to transform it to probabilities. Then
+    concatenate the probabilities with 1 - these probabilities to
+    return a correctly formed ``y_proba``.


Maybe specify who expects this format. Something along the lines of formats outputs for use with BCE loss or something like that

Co-authored-by: ottonemo <[email protected]>

Improve explanations in comments and docstrings.

BenjaminBossan · 2020-07-06T21:28:06Z

@ottonemo I addressed your comments.

BenjaminBossan added 3 commits June 27, 2020 17:10

Implement tests for predict_nonlinearity

8f2c365

Implement predict_nonlinearity feature

892065c

Fixed two broken references in docs

abc6626

BenjaminBossan added the enhancement label Jun 27, 2020

BenjaminBossan requested review from thomasjpfan and ottonemo June 27, 2020 16:06

BenjaminBossan self-assigned this Jun 27, 2020

BenjaminBossan added 2 commits June 27, 2020 19:08

Fix: Don't use axis argument in torch.stack

188ed55

Fails in PyTorch 1.1

Fix: Don't use axis argument in torch.stack

012cf86

Fails in PyTorch 1.1 There was one more instance in a unit test that needed fixing.

thomasjpfan reviewed Jul 4, 2020

View reviewed changes

BenjaminBossan and others added 5 commits July 5, 2020 12:14

Reviewer comment: simplified import

0c8d088

Co-authored-by: Thomas J. Fan <[email protected]>

Reviewer comment: explicit axis keyword

4a63c3f

Co-authored-by: Thomas J. Fan <[email protected]>

Reviewer comment: use np.allclose for floats

f6b11ae

Co-authored-by: Thomas J. Fan <[email protected]>

Reviewer comment: explicit axis keyword

a1312d5

Co-authored-by: Thomas J. Fan <[email protected]>

Reviewer comment: explicit axis keyword

eb28bff

Co-authored-by: Thomas J. Fan <[email protected]>

thomasjpfan approved these changes Jul 5, 2020

View reviewed changes

BenjaminBossan and others added 4 commits July 5, 2020 20:06

Reviewer comment: fix typo in comment

0f2a90a

Co-authored-by: Thomas J. Fan <[email protected]>

Reviewer comment: Update docstring to warn about lambda and pickle

595b1e6

Make get_predict_nonlinearity private

fe171b9

Merge branch 'master' into feature/predict-nonlinearity

a980f9d

ottonemo reviewed Jul 6, 2020

View reviewed changes

BenjaminBossan and others added 2 commits July 6, 2020 23:22

Reviewer comment: improve comment

f1d42df

Co-authored-by: ottonemo <[email protected]>

Address reviewer comments

d466d9d

Improve explanations in comments and docstrings.

Merge branch 'master' into feature/predict-nonlinearity

f9cab47

ottonemo approved these changes Jul 28, 2020

View reviewed changes

ottonemo merged commit 5dedb07 into master Jul 28, 2020

This was referenced Jul 28, 2020

WIP: Add transform_proba method to NeuralNet #580

Closed

Unify predict_proba for neural net classifiers #572

Closed

BenjaminBossan deleted the feature/predict-nonlinearity branch July 30, 2020 22:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add predict nonlinearity #662

Add predict nonlinearity #662

BenjaminBossan commented Jun 27, 2020 •

edited

Loading

thomasjpfan left a comment

thomasjpfan Jul 4, 2020

BenjaminBossan Jul 5, 2020

thomasjpfan Jul 5, 2020

BenjaminBossan Jul 5, 2020

thomasjpfan left a comment

thomasjpfan Jul 5, 2020

ottonemo left a comment

ottonemo Jul 6, 2020

BenjaminBossan Jul 6, 2020

ottonemo Jul 6, 2020

BenjaminBossan Jul 6, 2020

ottonemo Jul 6, 2020

BenjaminBossan Jul 6, 2020

ottonemo Jul 6, 2020

BenjaminBossan Jul 6, 2020

ottonemo Jul 6, 2020

BenjaminBossan commented Jul 6, 2020

	# output from predict_proba to be modified, since we set
	# output from predict_proba to be modified, thus we set

	# BCEWithLogitsLoss should return valid probabilities
	# BCEWithLogitsLoss criterion: nonlinearity should return valid probabilities

Add predict nonlinearity #662

Add predict nonlinearity #662

Conversation

BenjaminBossan commented Jun 27, 2020 • edited Loading

thomasjpfan left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

thomasjpfan left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ottonemo left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

BenjaminBossan commented Jul 6, 2020

BenjaminBossan commented Jun 27, 2020 •

edited

Loading