Home
Scholarly Works
Best-subset instrumental variable selection method...
Journal article

Best-subset instrumental variable selection method using mixed integer optimization with applications to health-related quality of life and education–wage analyses

Abstract

The classical best-subset selection method has been demonstrated to be nondeterministic polynomial-time-hard and thus presents computational challenges. This problem can now be solved via advanced mixed integer optimization (MIO) algorithms for linear regression. We extend this methodology to linear instrumental variable (IV) regression and propose the best-subset instrumental variable (BSIV) method incorporating the MIO procedure. Classical IV estimation methods assume that IVs must not directly impact the outcome variable and should remain uncorrelated with nonmeasured variables. However, in practice, IVs are likely to be invalid, and existing methods can lead to a large bias relative to standard errors in certain situations. The proposed BSIV estimator is robust in estimating causal effects in the presence of unknown IV validity. We demonstrate that the BSIV using MIO algorithms outperforms two-stage least squares, Lasso-type IVs, and two-sample analysis (median and mode estimators) through Monte Carlo simulations in terms of bias and relative efficiency. We analyze two datasets involving the health-related quality of life index and proximity and the education–wage relationship to demonstrate the utility of the proposed method.

Authors

Qasim M; Månsson K; Balakrishnan N

Journal

Statistics and Computing, Vol. 36, No. 1,

Publisher

Springer Nature

Publication Date

February 1, 2026

DOI

10.1007/s11222-025-10760-1

ISSN

0960-3174

Contact the Experts team