Implement SyntheticBanditDataWithActionEmbeds and Marginalized IPS #155

usaito · 2022-02-13T17:19:52Z

Overview

Implement obp.ope.MarginalziedInverseProbabilityWeighting and obp.ope.SelfNormalizedMarginalizedInverseProbabilityWeighting. The first one is the main proposal of the reference paper (called MIPS in that paper), and the latter is a trivial extension.
Implement obp.dataset.SyntheticBanditDatasetWithActionEmbeds, which generates synthetic bandit data with action embeddings consisting of some discrete random variables (e.g., category information of movies). The data generating process of this class follows the synthetic experiment conducted in the reference paper.

Reference

Yuta Saito and Thorsten Joachims.
"Off-Policy Evaluation for Large Action Spaces via Embeddings.", 2022.
https://arxiv.org/abs/2202.06317

…nditDatasetWithActionEmbeds

fullflu

I'm sorry for the late review...!

My 1st review mainly focuses on the estimators.

In the 2nd review, I will check the testing details.

obp/dataset/synthetic_embed.py

obp/ope/estimators_embed.py

fullflu · 2022-03-26T16:42:48Z

obp/ope/estimators_embed.py

+                else:
+                    exclude_d_idx = np.where(current_feat != exclude_d, True, False)
+                    return theta_j[-1]
+            exclude_d_idx = np.where(current_feat != exclude_d, True, False)


[ask]

This line seems to exclude a feature that has the smallest CDF.
Is this a reasonable algorithm?
(Intuitively, the smallest lower bound feature has some reason to be excluded.)

Yes, this is intended. theta_j[-1] is the last element of theta_j (or equivalently the last element of theta_list), this is not necessarily the element having the smallest CDF.

theta_j[-1] is the last element of theta_j (or equivalently the last element of theta_list), this is not necessarily the element having the smallest CDF.

That is OK!
In my understanding, the feature extraction is implemented as the following steps:

The last element of idx_list indicate a feature that has the smallest CDF among the current_feature (idx_list = np.argsort(cnf_list_)[::-1])

The last element of idx_list is excluded if not returned (idx_without_d = np.where(current_feat != excluded_dim, True, False))

That is why a feature that has the smallest CDF seems to be excluded.

Are the steps correct?
And I cannot understand why the last element of idx_list is excluded in the step3.

@fullflu Thanks for clarifying your point!

The last element of idx_list indicate a feature that has the smallest CDF among the current_feature

Let me first clarify that the last element of idx_list is the feature that leads to the smallest CDF when it is excluded from OPE. And if all features pass the criterion (np.abs(theta_j - theta_i) <= cnf_i + C * cnf_j).all(), we know that we can exclude any feature from current_feature and proceed to the next round. Then, I think we should exclude the feature that leads to the smallest CDF, as it has already satisfied the criterion. (if this is not true, which you think is the appropriate feature to exclude when proceed to the next iteration?)

Thank you for your thorough explanation!
I read the algorithm again, and I find that the process has no problem!

(I think that excluding the feature that leads to the smallest CDF or the largest CDF might be a greedy algorithm, and we cannot decide which one is better.)

@fullflu Thanks for checking again!

the smallest CDF or the largest CDF might be a greedy algorithm, and we cannot decide which one is better.)

I agree, but I thought the feature that leads to the smallest CDF should be removed because passing (np.abs(theta_j - theta_i) <= cnf_i + C * cnf_j).all() becomes the most difficult with the smallest CDF, and if the feature with the smallest CDF pass the criterion, then we can be aggressive and simply remove that feature.

obp/ope/estimators_embed.py

usaito · 2022-03-27T10:43:14Z

@fullflu Thanks for the detailed feedback!

usaito added 5 commits February 13, 2022 12:13

implement SyntheticBanditDatasetWithActionEmbeds

ef16758

implement marginalized ips

5eeb95d

resolve conflicts

1a4fd79

fix checkings

5dd4431

add tests about mips

74846e4

usaito changed the title ~~[WIP] Implement Marginalized IPS~~ Implement Marginalized IPS Feb 15, 2022

usaito changed the title ~~Implement Marginalized IPS~~ Implement SyntheticBanditDataWithActionEmbeds and Marginalized IPS Feb 15, 2022

usaito added 3 commits February 17, 2022 17:23

add a parameter to control the embedding distribution for SyntheticBa…

4932a49

…nditDatasetWithActionEmbeds

update README

756be41

update README

2bdd155

fullflu reviewed Mar 26, 2022

View reviewed changes

reflect review

623510e

usaito added 7 commits March 27, 2022 07:02

fix a bug of mipw with one-dimensional action embed

66e052f

modify some variable names in MIPS

a8a3278

modify init learning rate

0dda232

fix tests on mipw

22fd3ad

apply black

8a8ac6f

refactor nnpolicylearner

895e642

fix a bug in nnpolicylearner

10b1f52

usaito merged commit 628c8c8 into master Apr 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement SyntheticBanditDataWithActionEmbeds and Marginalized IPS #155

Implement SyntheticBanditDataWithActionEmbeds and Marginalized IPS #155

Uh oh!

usaito commented Feb 13, 2022 •

edited

Loading

Uh oh!

fullflu left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fullflu Mar 26, 2022

Uh oh!

usaito Mar 27, 2022 •

edited

Loading

Uh oh!

fullflu Apr 2, 2022

Uh oh!

usaito Apr 2, 2022 •

edited

Loading

Uh oh!

fullflu Apr 2, 2022

Uh oh!

usaito Apr 2, 2022

Uh oh!

Uh oh!

usaito commented Mar 27, 2022

Uh oh!

Uh oh!

Implement SyntheticBanditDataWithActionEmbeds and Marginalized IPS #155

Implement SyntheticBanditDataWithActionEmbeds and Marginalized IPS #155

Uh oh!

Conversation

usaito commented Feb 13, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Reference

Uh oh!

fullflu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fullflu Mar 26, 2022

Choose a reason for hiding this comment

Uh oh!

usaito Mar 27, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

fullflu Apr 2, 2022

Choose a reason for hiding this comment

Uh oh!

usaito Apr 2, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

fullflu Apr 2, 2022

Choose a reason for hiding this comment

Uh oh!

usaito Apr 2, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

usaito commented Mar 27, 2022

Uh oh!

Uh oh!

usaito commented Feb 13, 2022 •

edited

Loading

usaito Mar 27, 2022 •

edited

Loading

usaito Apr 2, 2022 •

edited

Loading