Skip to content

Commit 40691e3

Browse files
Update README.md
1 parent 9eba1ba commit 40691e3

File tree

1 file changed

+12
-1
lines changed

1 file changed

+12
-1
lines changed

README.md

+12-1
Original file line numberDiff line numberDiff line change
@@ -26,10 +26,21 @@ You can read our paper "BBQ: A Hand-Built Bias Benchmark for Question Answering"
2626
- `index` and `cat` columns correspond to the `example_id` and `cateogry` from the data files
2727
- Values in `ans0`, `ans1`, and `ans2` correspond to the logits for each of the three answer options from the data files
2828
- supplemental
29-
- Description: Additional files used in validation and selecting names for the vocabulary
29+
- Description: Additional files used in validation and selecting names for the vocabulary and additional metadata to make analysis easier
3030
- Contents:
3131
- MTurk_validation contains the HIT templates, scripts, input data, and results from our MTurk validations
3232
- name_job_data contains files downloaded that contain name & demographic information or occupation prestige scores for developing these portions of the vocabulary
33+
- `additional_metadata.csv`, with the following structure:
34+
- `category`: the bias category, corresponds to files from the `data` folder
35+
- `question_id`: the id number of the question, represented in the files in the `data` folder and also in the template files
36+
- `example_id`: the unique example id within each category, should be used with `category` to merge this file
37+
- `target_loc`: the index of the answer option that corresponds to the bias target. Used in computing the bias score
38+
- `label_type`: whether the label used for individuals is an explicit identity `label` or a proper `name`
39+
- `Known_stereotyped_race` and `Known_stereotyped_var2` are only defined for the intersectional templates. Includes all target race and gender/SES groups for that example
40+
- `Relevant_social_values` from the template files
41+
- `corr_ans_aligns_race` and `corr_ans_aligns_var2` are only defined for the intersectional templates. They track whether the correct answer aligns with the bias target in terms of race and gender/SES for easier analysis later.
42+
- `full_cond` is only defined for the intersectional templates. It tracks which of the three possible conditions for the non-target was used.
43+
- `Known_stereotyped_groups` is only defined for the non-intersectional templates. Includes all target groups for that example
3344
- templates
3445
- Description: This folder contains all the templates and vocabulary used to create BBQ
3546
- Contents: 11 csv files that contain the templates used in BBQ, 1 csv file listing all filler items used in the validation, 2 csv files for the BBQ vocabulary.

0 commit comments

Comments
 (0)