Título: Parallel Implementation of Classification Algorithms Based on Cloud Computing Environment
Autores: Zhou, Lijuan; Information Engineering College
Wang, Hui; Information Engineering College
Wang, Wenbo; Information Engineering College
Fecha: 2012-09-01
Publicador: TELKOMNIKA: Indonesian journal of electrical engineering
Fuente:
Tipo: info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
Tema: No aplica
Descripción: As an important task of data mining, Classification has been received considerable attention in many applications, such as information retrieval, web searching, etc. The enlarging volumes of information emerging by the progress of technology and the growing individual needs of data mining, makes classifying of very large scale of data a challenging task. In order to deal with the problem, many researchers try to design efficient parallel classification algorithms. This paper introduces the classification algorithms and cloud computing briefly, based on it analyses the bad points of the present parallel classification algorithms, then addresses a new model of parallel classifying algorithms. And it mainly introduces a parallel Naïve Bayes classification algorithm based on MapReduce, which is a simple yet powerful parallel programming technique. The experimental results demonstrate that the proposed algorithm improves the original algorithm performance, and it can process large datasets efficiently on commodity hardware.
Idioma: Inglés