You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
<summary><strong>Table of Contents</strong></summary>
18
19
19
-
-[Open Bandit Pipeline: a research framework for bandit algorithms and off-policy evaluation](#open-bandit-pipeline-a-research-framework-for-bandit-algorithms-and-off-policy-evaluation)
20
+
-[Open Bandit Pipeline: a research framework for off-policy evaluation and learning](#open-bandit-pipeline-a-research-framework-for-bandit-algorithms-and-off-policy-evaluation)
-[Algorithms and OPE Estimators Supported](#algorithms-and-ope-estimators-supported)
24
-
-[Topics and Tasks](#topics-and-tasks)
25
25
-[Installation](#installation)
26
26
-[Usage](#usage)
27
27
-[(1) Data loading and preprocessing](#1-data-loading-and-preprocessing)
@@ -44,7 +44,7 @@
44
44
*Open Bandit Dataset* is a public real-world logged bandit dataset.
45
45
This dataset is provided by [ZOZO, Inc.](https://corp.zozo.com/en/about/profile/), the largest fashion e-commerce company in Japan.
46
46
The company uses some multi-armed bandit algorithms to recommend fashion items to users in a large-scale fashion e-commerce platform called [ZOZOTOWN](https://zozo.jp/).
47
-
The following figure presents examples of displayed fashion items as actions.
47
+
The following figure presents the displayed fashion items as actions.
48
48
The figure shows that there are three *positions* in the data.
@@ -56,7 +56,7 @@ The figure shows that there are three *positions* in the data.
56
56
57
57
We collected the dataset in a 7-day experiment in late November 2019 on three “campaigns,” corresponding to all, men's, and women's items, respectively.
58
58
Each campaign randomly used either the Uniform Random policy or the Bernoulli Thompson Sampling (Bernoulli TS) policy for the data collection.
59
-
This dataset is unique in that it contains a set of *multiple* logged bandit datasets collected by running different policies on the same platform. This enables realistic and reproducible experimental comparisons of different OPE estimators for the first time (see Section 5 of the reference [paper](https://arxiv.org/abs/2008.07146) or the package [documentation](https://zr-obp.readthedocs.io/en/latest/evaluation_ope.html) for the details of the evaluation of OPE protocol with Open Bandit Dataset).
59
+
Open Bandit Dataset is unique in that it contains a set of *multiple* logged bandit datasets collected by running different policies on the same platform. This enables realistic and reproducible experimental comparisons of different OPE estimators for the first time (see Section 5 of the reference [paper](https://arxiv.org/abs/2008.07146) or the package [documentation](https://zr-obp.readthedocs.io/en/latest/evaluation_ope.html) for the details of the evaluation of OPE protocol with Open Bandit Dataset).
Please refer to Section 2/Appendix of the reference [paper](https://arxiv.org/abs/2008.07146) or the package [documentation](https://zr-obp.readthedocs.io/en/latest/ope.html) for the basic formulation of OPE and the definitions of supported OPE estimators.
140
+
Please refer to Section 2/Appendix of the reference [paper](https://arxiv.org/abs/2008.07146) or the package [documentation](https://zr-obp.readthedocs.io/en/latest/ope.html) for the basic formulation of OPE and the supported estimators.
138
141
Note that, in addition to the above algorithms and estimators, Open Bandit Pipeline provides flexible interfaces.
139
142
Therefore, researchers can easily implement their own algorithms or estimators and evaluate them with our data and pipeline.
140
143
Moreover, Open Bandit Pipeline provides an interface for handling real-world logged bandit data.
141
144
Thus, practitioners can combine their own real-world data with Open Bandit Pipeline and easily evaluate bandit algorithms' performance in their settings with OPE.
142
145
143
146
144
-
## Topics and Tasks
145
-
Open Bandit Dataset and Pipeline facilitate the following research topics or practical tasks.
146
-
147
-
### Research
148
-
149
-
Researchers can evaluate the performance of their bandit algorithms (in bandit papers) or the accuracy of their OPE estimators (in OPE papers) in an easy, standardized manner with Open Bandit Pipeline. One can implement these types of experiments for their research papers using synthetic bandit data, multi-class classification data, or the real-world Open Bandit Dataset.
150
-
151
-
-**Evaluation of Bandit Algorithms with Synthetic/Classification/Open Bandit Data**
152
-
-**Evaluation of OPE with Synthetic/Classification/Open Bandit Data**
153
-
154
-
In particular, we prepare some example experiments about the evaluation and comparison of OPE estimators in [examples](./examples/).
155
-
156
-
### Practice
157
-
158
-
Practitioners can improve their automated decision making systems using online/batch bandit policies implemented in the policy module. Moreover, they can easily evaluate such bandit policies using historical logged bandit data and OPE without A/B testing. Specifically, one can implement OPE of batch bandit algorithms with the standard OPE procedure introduced in [examples/quickstart/obd.ipynb](./examples/quickstart/obd.ipynb).
-**Off-Policy Evaluation of Online Bandit Algorithms**
162
-
-**Off-Policy Evaluation of Offline(Batch) Bandit Algorithms**
163
-
164
-
165
147
# Installation
166
148
167
149
You can install OBP using Python's package manager `pip`.
@@ -179,7 +161,7 @@ python setup.py install
179
161
180
162
Open Bandit Pipeline supports Python 3.7 or newer. See [pyproject.toml](./pyproject.toml) for other requirements.
181
163
182
-
# Usage Examples
164
+
# Usage
183
165
184
166
## Example with Synthetic Bandit Data
185
167
@@ -343,6 +325,8 @@ Bibtex:
343
325
}
344
326
```
345
327
328
+
The paper has been accepted at *NeurIPS2021 Datasets and Benchmarks Track*. The camera-ready version of the paper is available [here](https://datasets-benchmarks-proceedings.neurips.cc/paper/2021/hash/33e75ff09dd601bbe69f351039152189-Abstract-round2.html).
329
+
346
330
# Google Group
347
331
If you are interested in the Open Bandit Project, you can follow the updates at its google group: https://groups.google.com/g/open-bandit-project
0 commit comments