categorical accuracy doesn't account for mask (w/ proposed solution) #2260

braingineer · 2016-04-11T15:27:12Z

Hi all,

currently, accuracy is

def categorical_accuracy(y_true, y_pred):
    return K.mean(K.equal(K.argmax(y_true, axis=-1),
                          K.argmax(y_pred, axis=-1)))

I am running an RNN with variable length sentences and I want to get the accuracy of predictions at every time step. So basically, for a 3dim tensor, I have a binary matrix. I want each timestep of each batch to only be counted in the accuracy mean if it has a 1 entry in the binary matrix.

Proposed solution mwe (though, this is in theano, not using the abstracted keras backend.. but the change is trivial):

import numpy as np
import theano.tensor as T

### (batch size=2, seq len=3, output classes=5)
pred = np.arange(30, dtype=np.float32).reshape(2,3,5)
pred /= pred.sum(axis=-1, keepdims=True)

### target matrix to match pred
target1 = np.ones_like(pred)
target1[:,:,:-1] = 0

### target matrix to match mask
target2 = np.ones_like(pred)
target2[:,:,:-1] = 0
target2[0,-1,-1] = 0

### mask; 
# [   [1, 1, 0]
#     [1, 1, 1]  ] 
mask = np.ones(pred.shape[:-1])
mask[0,-1] = 0

# preserve last dimension 
flat_shape = lambda mat: (reduce(lambda x,y: x*y, mat.shape[:-1]), mat.shape[-1])
F_flat = lambda mat: T.reshape(mat, flat_shape(mat))

# compare two matrices
F_comp = lambda mat1, mat2: T.eq(T.argmax(mat1,axis=-1), T.argmax(mat2,axis=-1))

# mask the matrix
F_mask = lambda mat, mask: mat[mask.flatten().nonzero()]

# get the mean
F_mean = lambda mat: T.mean(mat)

## current method for categorical accuracy
F_score1 = lambda mat1, mat2: F_mean(F_comp(mat1, mat2))

## masked method
F_score2 = lambda mat1, mat2, mask: F_mean(F_mask(F_comp(F_flat(mat1), F_flat(mat2)), mask))

F_score1(pred, target1).eval() ## returns 1.0
F_score1(pred, target2).eval() ## returns 0.83
F_score2(pred, target1, mask).eval() ## returns 1.0
F_score2(pred, target2, mask).eval() ## returns 1.0

edit: currently using this

def categorical_accuracy(y_true, y_pred, mask=None):
    if mask is not None:
        eval_shape = (reduce(mul, y_true.shape[:-1]), y_true.shape[-1])
        y_true_ = K.reshape(y_true, eval_shape)
        y_pred_ = K.reshape(y_pred, eval_shape)
        flat_mask = K.flatten(mask)
        comped = K.equal(K.argmax(y_true_, axis=-1),
                          K.argmax(y_pred_, axis=-1))
        ## not sure how to do this in tensor flow
        good_entries = flat_mask.nonzero()[0]
        return K.mean(K.gather(comped, good_entries))

    else:
        return K.mean(K.equal(K.argmax(y_true, axis=-1),
                              K.argmax(y_pred, axis=-1)))

and i've shot from loss: 3.9160 - acc: 0.1424 to loss: 3.9165 - acc: 0.2614

The text was updated successfully, but these errors were encountered:

braingineer · 2016-04-13T05:40:25Z

if anyone knows the equivalent of vector.nonzero() in tensorflow, I could write a pull request to update the backend and metrics. I also have a perplexity implementation related to categorical entropy I could put in as well:

def perplexity(y_true, y_pred, mask=None):
    if mask is not None:
        y_pred /= K.sum(y_pred, axis=-1, keepdims=True)
        mask = K.permute_dimensions(K.reshape(mask, y_true.shape[:-1]), (0, 1, 'x'))
        truth_mask = K.flatten(y_true*mask).nonzero()[0]  ### How do you do this on tensorflow?
        predictions = K.gather(y_pred.flatten(), truth_mask)
        return K.pow(2, K.mean(-K.log2(predictions)))
    else:
        return K.pow(2, K.mean(-K.log2(y_pred)))

braingineer · 2016-06-24T02:13:13Z

ya that's what I did:

https://github.com/braingineer/keras/blob/master/keras/metrics.py#L10

Though, it's better if you use the mask that's passed with the network. It requires a commitment inside the main part of Keras too to push the mask through. But, it requires some some operations that I don't know how to implement inside Tensorflow, so I haven't pushed them upstream.

braingineer · 2016-06-28T19:44:28Z

No, I solved it in theano ages ago (as per my first answer). I just didn't solve it for tensorflow, hence why this is still open.

The passing of the mask to the accuracy calculations should be happening anyway (imo).

tbsflk · 2016-11-23T23:21:25Z

I used the implementation provided by @braingineer

 def categorical_accuracy(y_true, y_pred, mask=None)
    if mask is not None:
       ...
    else:
       ...

but the problem is that mask is never passed into the function. Do I have to change something else somewhere to make that happen?

* Reshape y_pred and y_true from (samples, timesteps, ... ) to (samples * timesteps, ... ) * Filter out masked timesteps from y_pred and y_true * Added K.where() and extended functionality of K.flatten() * Added/changed corresponding tests

OmniaZayed · 2020-06-30T16:11:39Z

Hi @braingineer,
Thank you for sharing your code.
I have the same problem as @tbsflk where the mask is not passed to the function. Any thoughts on that?

braingineer mentioned this issue Apr 12, 2016

How to 'accuracy' is calculated in different cases in sequence learning using RNN? #2275

Closed

bojan-karlas mentioned this issue Feb 5, 2017

Correct calculation of metrics with masking (#2260) #5276

Closed

stale bot added the stale label May 23, 2017

stale bot closed this as completed Jun 22, 2017

fsx950223 mentioned this issue Sep 22, 2023

add gradient accumulator tensorflow/addons#2525

Closed

21 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

categorical accuracy doesn't account for mask (w/ proposed solution) #2260

categorical accuracy doesn't account for mask (w/ proposed solution) #2260

braingineer commented Apr 11, 2016

braingineer commented Apr 13, 2016

braingineer commented Jun 24, 2016

braingineer commented Jun 28, 2016

tbsflk commented Nov 23, 2016 •

edited

Loading

OmniaZayed commented Jun 30, 2020

categorical accuracy doesn't account for mask (w/ proposed solution) #2260

categorical accuracy doesn't account for mask (w/ proposed solution) #2260

Comments

braingineer commented Apr 11, 2016

braingineer commented Apr 13, 2016

braingineer commented Jun 24, 2016

braingineer commented Jun 28, 2016

tbsflk commented Nov 23, 2016 • edited Loading

OmniaZayed commented Jun 30, 2020

tbsflk commented Nov 23, 2016 •

edited

Loading