When Partly Missing Data Matters in Software Effort Development Prediction
-
- Twala Bhekisipho
- Department of Electrical and Electronic Engineering Science, University of Johannesburg
Search this article
Description
<p>The major objective of the paper is to investigate a new probabilistic supervised learning approach that incorporates “missingness” into a decision tree classifier splitting criterion at each particular attribute node in terms of software effort development predictive accuracy. The proposed approach is compared empirically with ten supervised learning methods (classifiers) that have mechanisms for dealing with missing values. 10 industrial datasets are utilized for this task. Overall, missing incorporated in attributes 3 is the top performing strategy, followed by C4.5, missing incorporated in attributes, missing incorporated in attributes 2, missing incorporated in attributes, linear discriminant analysis and so on. Classification and regression trees and C4.5 performed well in data with high correlations among attributes while k-nearest neighbour and support vector machines performed well in data with higher complexity (limited number of instances). The worst performing method is repeated incremental pruning to produce error reduction.</p>
Journal
-
- Journal of Advanced Computational Intelligence and Intelligent Informatics
-
Journal of Advanced Computational Intelligence and Intelligent Informatics 21 (5), 803-812, 2017-09-20
Fuji Technology Press Ltd.
- Tweet
Details 詳細情報について
-
- CRID
- 1390282763068012288
-
- NII Article ID
- 130007520206
-
- NII Book ID
- AA12042502
-
- ISSN
- 18838014
- 13430130
-
- NDL BIB ID
- 028510751
-
- Text Lang
- en
-
- Data Source
-
- JaLC
- NDL
- Crossref
- CiNii Articles
-
- Abstract License Flag
- Disallowed