Empirical Analysis of Model Selection for Heterogeneous Causal Effect Estimation

Code accompanying the paper Empirical Analysis of Model Selection for Heterogeneous Causal Effect Estimation. The paper has been accepted at ICLR 2024 for a spotlight presentation. OpenReview. Talk.

Brief note about the paper

We study the problem of model selection in causal inference, specifically for conditional average treatment effect (CATE) estimation. Unlike machine learning, there is no perfect analogue of cross-validation for model selection as we do not observe the counterfactual potential outcomes. Towards this, a variety of surrogate metrics have been proposed for CATE model selection that use only observed data. However, we do not have a good understanding regarding their effectiveness due to limited comparisons in prior studies. We conduct an extensive empirical analysis to benchmark the surrogate model selection metrics introduced in the literature, as well as the novel ones introduced in this work. We ensure a fair comparison by tuning the hyperparameters associated with these metrics via AutoML, and provide more detailed trends by incorporating realistic datasets via generative modeling. Our analysis suggests novel model selection strategies based on careful hyperparameter selection of CATE estimators and causal ensembling.

Reproducing results of the paper

A script to reproduce results of the paper can be executed as follows.

python scripts/reproduce_results.py

Setup

Please download the ACIC 2016 datasets and place them in root_dir/acic_2016/ directory.

ACIC 2016 benchmark link: https://jenniferhill7.wixsite.com/acic-2016/competition

Use the requirements.txt file for installing the dependencies.

Training CATE estimators

To train CATE estimators we describe the commands ahead for the dataset twins and seed 0. The same commands can be executed for the remaining seed values and datasets.

First we create the train/val splits for each dataset by executing the following command for the Real Cause datasets.

python generate_date.py --seed 0 --meta_dataset realcause

For the case datasets in the ACIC 2016 benchmark, we create the train/val splits by executing the following command.

python generate_date.py --seed 0 --meta_dataset acic

Before training the CATE estimator, we first need to ensure that we have selected the corresponding nuisance models via AutoML. To do the nuisance model selection for a given dataset and seed value, execute the following command:

python nuisance_model_selection.py --seed 0 --dataset twins

After the nuisance model selection, we can execute the following command to train a particular CATE estimator for this dataset and seed.

python train.py --seed 0 --dataset twins --estimator dml_learner

The complete list of CATE estimators to be trained is as follows: ['dml_learner', 'dr_learner', 'x_learner', 'causal_forest_learner', 's_learner', 't_learner', 's_learner_upd' ]

After training all the CATE estimators for this datasets and seed, we will train the Ensemble CATE estimators.

python ensemble_train.py --seed 0 --dataset twins

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
GenModelCkpts		GenModelCkpts
causal_estimators		causal_estimators
data		data
datasets		datasets
models		models
scripts		scripts
utils		utils
GenModelCkpts.zip		GenModelCkpts.zip
README.md		README.md
ensemble_train.py		ensemble_train.py
generate_data.py		generate_data.py
nuisance_model_selection.py		nuisance_model_selection.py
requirements.txt		requirements.txt
train.py		train.py
train_generator.py		train_generator.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Empirical Analysis of Model Selection for Heterogeneous Causal Effect Estimation

Brief note about the paper

Reproducing results of the paper

Setup

Training CATE estimators

About

Releases

Packages

Languages

divyat09/cate-estimator-selection

Folders and files

Latest commit

History

Repository files navigation

Empirical Analysis of Model Selection for Heterogeneous Causal Effect Estimation

Brief note about the paper

Reproducing results of the paper

Setup

Training CATE estimators

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages