Tensorflow example squad's run_qa.py miss token_type_ids inputs

### System Info

transformers==4.20.1, torch==1.9.0, tensorflow2==2.9.

### Who can help?

_No response_

### Information

- [X] The official example scripts
- [ ] My own modified scripts

### Tasks

- [X] An officially supported task in the `examples` folder (such as GLUE/SQuAD, ...)
- [ ] My own task or dataset (give details below)

### Reproduction

Steps to reproduce the behavior

1. Download SQuADv.11 fine-tuned bert large weights from: https://huggingface.co/bert-large-uncased-whole-word-masking-finetuned-squad
2. Successfully reproduce the inference F1-score by runing this [pytorch example](https://github.com/huggingface/transformers/blob/main/examples/pytorch/question-answering/run_qa.py).
3. But fail to reproduce the inference f1-score by runing this [tensorflow2 example](https://github.com/huggingface/transformers/blob/main/examples/tensorflow/question-answering/run_qa.py).
4. The reason is the tensorflow example miss the token_type_ids inputs. I add this input at following position to solved this problem:

https://github.com/huggingface/transformers/blob/main/examples/tensorflow/question-answering/run_qa.py#L640
`        tensor_keys = ["attention_mask", "token_type_ids", "input_ids"]`

https://github.com/huggingface/transformers/blob/main/examples/tensorflow/question-answering/run_qa.py#L661
```
            eval_inputs = {
                "input_ids": tf.ragged.constant(processed_datasets["validation"]["input_ids"]).to_tensor(),
                "token_type_ids": tf.ragged.constant(processed_datasets["validation"]["token_type_ids"]).to_tensor(),
                "attention_mask": tf.ragged.constant(processed_datasets["validation"]["attention_mask"]).to_tensor(),
            }

```
https://github.com/huggingface/transformers/blob/main/examples/tensorflow/question-answering/run_qa.py#L681
```
            predict_inputs = {
                "input_ids": tf.ragged.constant(processed_datasets["test"]["input_ids"]).to_tensor(),
                "token_type_ids": tf.ragged.constant(processed_datasets["test"]["token_type_ids"]).to_tensor(),
                "attention_mask": tf.ragged.constant(processed_datasets["test"]["attention_mask"]).to_tensor(),
            }
```

### Expected behavior

Both pytorch and tensorflow example produce same F1-score based on [this weights](https://huggingface.co/bert-large-uncased-whole-word-masking-finetuned-squad).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Tensorflow example squad's run_qa.py miss token_type_ids inputs #18223

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Tensorflow example squad's run_qa.py miss token_type_ids inputs #18223

Description

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions