Best Subset Selection for Linear Regression Models via Mixed-Integer Optimization
-
- Takano Yuichi
- 筑波大学システム情報系
-
- Miyashiro Ryuhei
- 東京農工大学大学院工学研究院
Bibliographic Information
- Other Title
-
- 混合整数最適化による線形回帰モデルの最良変数選択
- コンゴウ セイスウ サイテキカ ニ ヨル センケイ カイキ モデル ノ サイリョウ ヘンスウ センタク
Search this article
Description
<p>Subset selection for regression models has long been recognized as an important task in statistics, and recently it has been actively studied in data mining and machine learning because of the increased amount of data handled. A mixed-integer optimization approach, which has received renewed attention for subset selection, is to formulate subset selection as a mathematical optimization problem and to solve it using a branch-and-bound method. The greatest advantage of this approach is that the best subset of explanatory variables can be selected with respect to the objective function (i.e., evaluation criteria of regression models) of the optimization problem. The authors have devised several formulations for simultaneously optimizing a subset of variables and its cardinality in terms of statistical criteria such as Mallows' Cp, adjusted R2, information criteria, and cross-validation criterion. In this paper, we explain these mixed-integer optimization formulations for best subset selection in linear regression models.</p>
Journal
-
- Journal of the Japan Statistical Society, Japanese Issue
-
Journal of the Japan Statistical Society, Japanese Issue 50 (2), 343-362, 2021-03-05
Japan Statistical Society
- Tweet
Details 詳細情報について
-
- CRID
- 1390005822567627776
-
- NII Article ID
- 130007995099
-
- NII Book ID
- AA1105098X
-
- ISSN
- 21891478
- 03895602
-
- NDL BIB ID
- 031362130
-
- Text Lang
- ja
-
- Article Type
- journal article
-
- Data Source
-
- JaLC
- NDL Search
- CiNii Articles
- KAKEN
-
- Abstract License Flag
- Disallowed