Título: Loan Default Prediction on Large Imbalanced Data Using Random Forests
Autores: Zhou, Lifeng; School of Mathematics and Statistics, Central South University,China
Wang, Hong; School of Mathematics and Statistics, Central South University,China
Fecha: 2012-10-01
Publicador: TELKOMNIKA: Indonesian journal of electrical engineering
Fuente:
Tipo: info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
Tema: loan default prediction; random forests; imbalanced data; parallel computing
Descripción: In this paper, we propose an improved random forest algorithm which allocates weights to decision trees in the forest during tree aggregation for prediction and their weights are easily calculated based on out-of-bag errors in training. We compare the performance of our proposed algorithm and the original one on loan default prediction datasets. We also use these two algorithms to create two kinds of balanced random forests to deal with imbalanced data problem. Experiments results show that our proposed algorithm beats the original random forest in terms of both balanced and overall accuracy metrics. Experiments also show that parallel random forests can greatly improve random forests’ efficiency during the learning process.
Idioma: Inglés