You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+9-9Lines changed: 9 additions & 9 deletions
Original file line number
Diff line number
Diff line change
@@ -21,7 +21,7 @@ Data and code for this research project are distributed across different places.
21
21
22
22
23
23
## The Lexicon
24
-
We created emotion lexicons for 91 languages, each one covers eight emotional variables and comprises over 100k word entries. There are several versions of the lexicons, the difference being the choice of the expansion model: There is a linear regression baseline and three versions of neural network models. The *main* version of our lexicons (the version we refer to in the main experiments of our paper and the one we would recommend to use) is referred to as as **MTL_grouped** (applying multi-task learning within two groups of our target variables). **If you are mainly interested in our lexicons, download [this](https://zenodo.org/record/3756607/files/MTL_grouped.zip?download=1) zip file.** It contains 91 tsv files which are named `<iso language code>.tsv`. Please refer to the [description of the Zenodo record](https://doi.org/10.5281/zenodo.3756606) for more details.
24
+
We created emotion lexicons for 91 languages, each one covers eight emotional variables and comprises over 100k word entries. There are several versions of the lexicons, the difference being the choice of the expansion model: There is a linear regression baseline and three versions of neural network models. The *main* version of our lexicons (the version we refer to in the main experiments of our paper and the one we would recommend to use) is referred to as as **MTL_grouped** (applying multi-task learning within two groups of our target variables). **If you are mainly interested in our lexicons, download [this](https://zenodo.org/record/3756607/files/MTL_grouped.zip?download=1) zip file.** It contains 91 tsv files which are named `<iso language code>.tsv`. Please refer to the [description of the Zenodo record](https://doi.org/10.5281/zenodo.3756606) for more details.
25
25
26
26
27
27
@@ -60,7 +60,7 @@ Recreating the lexicons from scratch requires the Source lexicon, data splits, a
60
60
61
61
* Get the file [Ratings_Warriner_et_al.csv](https://github.com/JULIELab/XANEW/blob/master/Ratings_Warriner_et_al.csv) (commit b1ed97e from 11 Nov 2019) and place it in `/memolon/data/Source`.
62
62
* Get the file [Warriner_BE.tsv](https://github.com/JULIELab/EmoMap/blob/master/coling18/main/lexicon_creation/lexicons/Warriner_BE.tsv) (commit dbfa3b9 from 15 Jun 2018) and place it in ``/memolon/data/Source``.
63
-
63
+
64
64
65
65
66
66
The python scripts for creating the lexicons can be found in `/memolon/src`. You can either `cd` there and simply run `run_all.sh` or follow the more detailed instructions below. Please take note that the whole process may take several hours. **You do not have to have a GPU to run our code in a reasonable amount of time.**
@@ -96,31 +96,31 @@ Running the gold evaluation and related analyses requires you to manually collec
96
96
and place it in `/memolon/data/TargetGold`.
97
97
* es1. Get the file `Redondo(2007).xls` from [Redondo et al. (2007)](https://doi.org/10.3758/BF03193031) and place it `/memolon/data/TargetGold`.
98
98
* es2. Get the file `13428_2015_700_MOESM1_ESM.csv` from [Stadthagen-Gonzalez et al. (2017)](https://doi.org/10.3758/BF03192999) and save it as `/memolon/data/TargetGold/Stadthagen_VA.csv`
99
-
* es3. Get the file `Hinojosa et al_Supplementary materials.xlsx` from [Hinojosa et al., (2015)](https://link.springer.com/article/10.3758%2Fs13428-015-0572-5) and place it in `/memolon/data/TargetGold`.
99
+
* es3. Get the file `Hinojosa et al_Supplementary materials.xlsx` from [Hinojosa et al. (2015)](https://link.springer.com/article/10.3758%2Fs13428-015-0572-5) and place it in `/memolon/data/TargetGold`.
100
100
* es4. Included in the download for es3.
101
101
* es5. Get the file `13428_2017_962_MOESM1_ESM.csv` from [Stadthagen-Gonzalez et al. (2018)](https://doi.org/10.3758/s13428-017-0962-y) and save it as `/memolon/data/TargetGold/Stadthagen_BE.csv`.
102
-
* es6. Get the file `13428_2016_768_MOESM1_ESM.xls` from [Ferre et al. (2017)](https://doi.org/10.3758/s13428-016-0768-3) ad save it as `/memolon/data/TargetGold/Ferre.xlsx`.
103
-
* de1. Get the file `13428_2013_426_MOESM1_ESM.xlsx` from [Schmidtke et al., 2014](https://doi.org/10.3758/s13428-013-0426-y) and save it as `/memolon/data/TargetGold/Schmidtke.xlsx`
102
+
* es6. Get the file `13428_2016_768_MOESM1_ESM.xls` from [Ferré et al. (2017)](https://doi.org/10.3758/s13428-016-0768-3) ad save it as `/memolon/data/TargetGold/Ferre.xlsx`.
103
+
* de1. Get the file `13428_2013_426_MOESM1_ESM.xlsx` from [Schmidtke et al. (2014)](https://doi.org/10.3758/s13428-013-0426-y) and save it as `/memolon/data/TargetGold/Schmidtke.xlsx`
104
104
* de2. Get the file `BAWL-R.xls` from [Vo et al. (2009)](https://doi.org/10.3758/BRM.41.2.534) which is currently available
105
105
[here](https://www.ewi-psy.fu-berlin.de/einrichtungen/arbeitsbereiche/allgpsy/Download/BAWL/index.html). You will need to request a password from the authors. Save the file **without password** as `/memolon/data/TargetGold/BAWL-R.xls`. We had to run an automatic file repair when oping it with Excel for the first time.
106
106
* de3. Get the file `LANG_database.txt` from [Kaske and Kotz (2010)](https://doi.org/10.3758/BRM.42.4.987) and place it `/memolon/data/TargetGold`.
107
107
* de4. Get de2 (see above). Then, get the file `13428_2011_59_MOESM1_ESM.xls` from [Briesemeister et al. (2011)](https://doi.org/10.3758/s13428-011-0059-y) and save it as `/memolon/data/TargetGold/Briesemeister.xls`.
108
-
* pl1. Get the file `data sheet 1.xlsx` from [Imbir 2016](https://doi.org/10.3389/fpsyg.2016.01081) and save it as `/memolon/data/TargetGold/Imbir.xlsx`.
108
+
* pl1. Get the file `data sheet 1.xlsx` from [Imbir (2016)](https://doi.org/10.3389/fpsyg.2016.01081) and save it as `/memolon/data/TargetGold/Imbir.xlsx`.
109
109
* pl2. Get the file `13428_2014_552_MOESM1_ESM.xlsx` from [Riegel et al. (2015)](https://doi.org/10.3758/s13428-014-0552-1) and save it as `/memolon/data/TargetGold/Riegel.xlsx`
110
110
* pl3. Get pl2 (see above). Then, get the file `S1 Dataset` from [Wierzba et al. (2015)](https://doi.org/10.1371/journal.pone.0132305)
111
111
and save it as `/memolon/data/TargetGold/Wierzba.xlsx`.
112
-
* zh1. Get CVAW 2.0 from [Yu et al. 2016](https://doi.org/10.18653/v1/N16-1066) which is distributed via
112
+
* zh1. Get CVAW 2.0 from [Yu et al. (2016)](https://doi.org/10.18653/v1/N16-1066) which is distributed via
113
113
[this website](http://nlp.innobic.yzu.edu.tw/resources/cvaw.html). Use Google Translate to 'translate' the words in `cvaw2.csv`
114
114
from traditional to simplified Chinese characters (you can batch-translate by copy-pasting multiple words separated by newline directly from the file). Save the modified file as `/memolon/data/TargetGold/cvaw2_simplied.csv`.
115
-
* zh2. Get the file `13428_2016_793_MOESM2_ESM.pdf` from [Yao et al. 2017](https://doi.org/10.3758/s13428-016-0793-2). Convert PDF to Excel (there are online tools for that but check the results for correctness) and save as `/memolon/data/TargetGold/Yao.xlsx`.
115
+
* zh2. Get the file `13428_2016_793_MOESM2_ESM.pdf` from [Yao et al. (2017)](https://doi.org/10.3758/s13428-016-0793-2). Convert PDF to Excel (there are online tools for that but check the results for correctness) and save as `/memolon/data/TargetGold/Yao.xlsx`.
116
116
* it. Get the data from [Montefinese et al. (2014)](https://doi.org/10.3758/s13428-013-0405-3). The website offers a PDF version
117
117
of the ratings. However, the formatting makes it very difficult to process automatically. Instead, the first author Maria Montefinese provided us with an Excel version. Save the ratings as `/memolon/data/TargetGold/Montefinese.xls`.
118
118
* pt. Get the file `13428_2011_131_MOESM1_ESM.xls` from [Soares et al. (2012)](https://doi.org/10.3758/s13428-011-0131-7).
119
119
Save it as `/memolon/data/TargetGold/Soares.xls`.
120
120
* nl. Get the file `13428_2012_243_MOESM1_ESM.xlsx` from [Moors et al. (2013)](https://doi.org/10.3758/s13428-012-0243-8).
121
121
Save it as `/memolon/data/TargetGold/Moors.xlsx`.
122
122
* id. Get the file `Data Sheet 1.XLSX` from [Sianipar et al. (2016)](https://doi.org/10.3389/fpsyg.2016.01907). Save it as `/memolon/data/TargetGold/Sianipar.xlsx`
123
-
* el. Get the data from [Palogiannidi et a. (2016)](https://www.aclweb.org/anthology/L16-1458): We downloaded the ratings via the [link](www.telecom.tuc.gr/~epalogiannidi/docs/resources/greek_affective_lexicon.zip)
123
+
* el. Get the data from [Palogiannidi et al. (2016)](https://www.aclweb.org/anthology/L16-1458): We downloaded the ratings via the [link](www.telecom.tuc.gr/~epalogiannidi/docs/resources/greek_affective_lexicon.zip)
124
124
provided in the paper on March 13, 2018. The link pointed to zip containing a single file `greek_affective_lexicon.csv` which we saved under `/memolon/data/TargetGold`. However, the original link does not work anymore (as of April 22, 2020). We recommend contacting the authors for a replacement.
125
125
* tr1. Get the file `TurkishEmotionalWordNorms.csv` from [Kapucu et al. (2018)](https://doi.org/10.1177/0033294118814722)
126
126
which is available [here](https://osf.io/rxtdm/). Place it under `/memolon/data/TargetGold`.
0 commit comments