You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+12-1
Original file line number
Diff line number
Diff line change
@@ -26,10 +26,21 @@ You can read our paper "BBQ: A Hand-Built Bias Benchmark for Question Answering"
26
26
-`index` and `cat` columns correspond to the `example_id` and `cateogry` from the data files
27
27
- Values in `ans0`, `ans1`, and `ans2` correspond to the logits for each of the three answer options from the data files
28
28
- supplemental
29
-
- Description: Additional files used in validation and selecting names for the vocabulary
29
+
- Description: Additional files used in validation and selecting names for the vocabulary and additional metadata to make analysis easier
30
30
- Contents:
31
31
- MTurk_validation contains the HIT templates, scripts, input data, and results from our MTurk validations
32
32
- name_job_data contains files downloaded that contain name & demographic information or occupation prestige scores for developing these portions of the vocabulary
33
+
-`additional_metadata.csv`, with the following structure:
34
+
-`category`: the bias category, corresponds to files from the `data` folder
35
+
-`question_id`: the id number of the question, represented in the files in the `data` folder and also in the template files
36
+
-`example_id`: the unique example id within each category, should be used with `category` to merge this file
37
+
-`target_loc`: the index of the answer option that corresponds to the bias target. Used in computing the bias score
38
+
-`label_type`: whether the label used for individuals is an explicit identity `label` or a proper `name`
39
+
-`Known_stereotyped_race` and `Known_stereotyped_var2` are only defined for the intersectional templates. Includes all target race and gender/SES groups for that example
40
+
-`Relevant_social_values` from the template files
41
+
-`corr_ans_aligns_race` and `corr_ans_aligns_var2` are only defined for the intersectional templates. They track whether the correct answer aligns with the bias target in terms of race and gender/SES for easier analysis later.
42
+
-`full_cond` is only defined for the intersectional templates. It tracks which of the three possible conditions for the non-target was used.
43
+
-`Known_stereotyped_groups` is only defined for the non-intersectional templates. Includes all target groups for that example
33
44
- templates
34
45
- Description: This folder contains all the templates and vocabulary used to create BBQ
35
46
- Contents: 11 csv files that contain the templates used in BBQ, 1 csv file listing all filler items used in the validation, 2 csv files for the BBQ vocabulary.
0 commit comments