Source code for the SIGIR 2023 paper An Offline Metric for the Debiasedness of Click Models
. For a standalone implementation of the proposed CMIP metric, see this repository.
Dependency management
- Setup conda / miniconda on your device.
- Create environment and install dependencies:
conda env create -f environment.yaml
- Activating environment:
conda activate sigir-cmip
All experimental runs are documented inside the scripts/
directory. To execute an experiment:
- Make the scripts executable:
chmod +x ./scripts/*
- Run a script locally use, e.g.:
./scripts/graded-pbm.sh
- To execute a script on a SLURM cluster add:
./scripts/graded-pbm.sh +launcher=slurm
- You can configure the SLURM resources in:
config/launcher/slurm.yaml
Documentation of each experiment can be found inside the scripts.
Automatically format and lint modified files in commit.
- Make sure you activate your environment
- Initialize pre-commit:
pre-commit install
- (Optional) Run on checks against all files (not just
changed):
pre-commit run --all-files
The project automatically downloads the dataset used in this work to: ~/.ltr_datasets
.
- You can change the directory by modifying the
base_dir
variable in:config/env.yaml
- To avoid downloading datasets, you can directly place the original .zip file into
the
download
subdirectory, e.g.:~/.ltr_datasets/download/MSLR-WEB30K.zip
Log metrics with Weights & Biases.
- Make sure you activate your environment
- Log into Weights & Biases before your first run:
wandb login
- Add your wandb entity and project name inside
config/config.yaml
All code for plotting is in the notebooks/
directory. The code requires the results to be logged to Weights & Biases (Section 5).
- Make sure you activate your environment
- Start a jupyterlab server:
python -m jupyterlab
- Add wandb parameters in notebook header and run all cells
You can find a list of model parameters and training configurations under config/
.
@inproceedings{Deffayet2023Debiasedness,
author = {Romain Deffayet and Philipp Hager and Jean-Michel Renders and Maarten de Rijke},
title = {An Offline Metric for the Debiasedness of Click Models},
booktitle = {Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR`23)},
organization = {ACM},
year = {2023},
}
This project uses the MIT license.