Detection of chronic kidney disease using machine learning algorithms with least number of predictors
Almasoud, Marwa and Ward, Tomás E.ORCID: 0000-0002-6173-6607
(2019)
Detection of chronic kidney disease using machine learning algorithms with least number of predictors.
International Journal of Soft Computing and Its Applications, 10
(8).
ISSN 2074-8523
Chronic kidney disease (CKD) is one of the most critical health problems due to its increasing prevalence. In this paper, we aim to test the ability of machine learning algorithms for the prediction of chronic kidney disease using the smallest subset of features. Several statistical tests have been done to remove redundant features such as the ANOVA test, the Pearson’s correlation, and the Cramer’s V test. Logistic regression, support vector machines, random forest, and gradient boosting algorithms have been trained and tested using 10-fold cross-validation. We achieve an accuracy of 99.1 according to F1-measure from Gradient Boosting classifier. Also, we found that hemoglobin has higher importance for both random forest and Gradient boosting in detecting CKD. Finally, our results are among the highest compared to previous studies but with less number of features reached so far. Hence, we can detect CKD at only $26.65 by performing three simple tests.
Metadata
Item Type:
Article (Published)
Refereed:
Yes
Uncontrolled Keywords:
Chronic Kidney Disease (CKD); Random Forest (RF); Gradient Boosting (GB); Logistic Regression (LR); Support Vector Machines (SVM); prediction