Comparison of random forest methods for conditional average treatment effect estimation with a continuous treatment

Stat Methods Med Res. 2024 Nov;33(11-12):1952-1966. doi: 10.1177/09622802241275401. Epub 2024 Oct 9.

Abstract

We are addressing the problem of estimating conditional average treatment effects with a continuous treatment and a continuous response, using random forests. We explore two general approaches: building trees with a split rule that seeks to increase the heterogeneity of the treatment effect estimation and building trees to predict Y as a proxy target variable. We conduct a simulation study to investigate several aspects including the presence or absence of confounding and colliding effects and the merits of locally centering the treatment and/or the response. Our study incorporates both existing and new implementations of random forests. The results indicate that locally centering both the response and treatment variables is generally the best strategy, and both general approaches are viable. Additionally, we provide an illustration using data from the 1987 National Medical Expenditure Survey.

Keywords: Conditional average treatment effect (CATE); causal modeling; colliding effect; confounding effect; continuous treatment; ensemble method; incremental modeling; local centering; random forest; tree-based method; uplift modeling.

Publication types

  • Comparative Study

MeSH terms

  • Computer Simulation
  • Humans
  • Models, Statistical*
  • Random Forest
  • Treatment Outcome