Hard thresholding regression

Scand Stat Theory Appl. 2019 Mar;46(1):314-328. doi: 10.1111/sjos.12353. Epub 2018 Sep 24.

Abstract

In this paper, we propose the hard thresholding regression (HTR) for estimating high-dimensional sparse linear regression models. HTR uses a two-stage convex algorithm to approximate the 0-penalized regression: The first stage calculates a coarse initial estimator, and the second stage identifies the oracle estimator by borrowing information from the first one. Theoretically, the HTR estimator achieves the strong oracle property over a wide range of regularization parameters. Numerical examples and a real data example lend further support to our proposed methodology.

Keywords: Lasso; best subset selection; linear programming; oracle property; sparsity; variable selection.