KDD 2020: Counterfactual Evaluation of Slate Recommendations with Sequential Reward Interactions

This repo contains the code for the Reward interaction Inverse Propensity Scoring off-policy estimator proposed in the KDD 2020 paper Counterfactual Evaluation of Slate Recommendations with Sequential Reward Interactions by James McInerney, Brian Brost, Praveen Chandar, Rishabh Mehrotra, Ben Carterette.

This implementation uses Python Beam and Google's Dataflow for running experiments to make it easier to scale to large datasets. However, if you are interested in running a simple simulation experiment, you can do so using the following command (make sure to install the dependencies, see Environment Setup)

PYTHONPATH=./ python run.py [output_path]

The script generates two files for each run. Use the analysis.ipynb notebook to generate the plots similar to the ones in the paper.

Environment Setup

Create a new virtual environment with for your supported Python version. We recommend the use of Anaconda for managing virtual environments. Create a new environment conda create --name rips python=3.7 and switch to the environment using conda activate rips.

$ pip install -r requirements.txt

Reward interaction IPS

The implementation of the Reward interaction IPS (RIPS) can be found in rips/eval/offpolicy/rips.py file.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
rips		rips
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
analysis.ipynb		analysis.ipynb
requirements.txt		requirements.txt
run.py		run.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KDD 2020: Counterfactual Evaluation of Slate Recommendations with Sequential Reward Interactions

Environment Setup

Reward interaction IPS

About

Releases

Packages

Languages

License

spotify-research/RIPS_KDD2020

Folders and files

Latest commit

History

Repository files navigation

KDD 2020: Counterfactual Evaluation of Slate Recommendations with Sequential Reward Interactions

Environment Setup

Reward interaction IPS

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages