A Comparative Performance Assessment of Ensemble Learning for Credit Scoring

  • Yiheng Li
    College of Management and Economics, Tianjin University, Tianjin 300072, China
  • Weidong Chen
    College of Management and Economics, Tianjin University, Tianjin 300072, China

書誌事項

公開日
2020-10-13
権利情報
  • https://creativecommons.org/licenses/by/4.0/
DOI
  • 10.3390/math8101756
公開者
MDPI AG

説明

<jats:p>Extensive research has been performed by organizations and academics on models for credit scoring, an important financial management activity. With novel machine learning models continue to be proposed, ensemble learning has been introduced into the application of credit scoring, several researches have addressed the supremacy of ensemble learning. In this research, we provide a comparative performance evaluation of ensemble algorithms, i.e., random forest, AdaBoost, XGBoost, LightGBM and Stacking, in terms of accuracy (ACC), area under the curve (AUC), Kolmogorov–Smirnov statistic (KS), Brier score (BS), and model operating time in terms of credit scoring. Moreover, five popular baseline classifiers, i.e., neural network (NN), decision tree (DT), logistic regression (LR), Naïve Bayes (NB), and support vector machine (SVM) are considered to be benchmarks. Experimental findings reveal that the performance of ensemble learning is better than individual learners, except for AdaBoost. In addition, random forest has the best performance in terms of five metrics, XGBoost and LightGBM are close challengers. Among five baseline classifiers, logistic regression outperforms the other classifiers over the most of evaluation metrics. Finally, this study also analyzes reasons for the poor performance of some algorithms and give some suggestions on the choice of credit scoring models for financial institutions.</jats:p>

収録刊行物

  • Mathematics

    Mathematics 8 (10), 1756-, 2020-10-13

    MDPI AG

被引用文献 (1)*注記

もっと見る

詳細情報 詳細情報について

問題の指摘

ページトップへ