Título: A PSO-based clustering approach assisted by initial clustering information
Autores: Velázquez, Carlos
Cagnina, Leticia
Errecalde, Marcelo Luis
Fecha: 2012-11-06
2012-10
2012-10
Publicador: Unversidad Nacional de La Plata
Fuente:

Tipo: Objeto de conferencia
Objeto de conferencia
Tema: Short-Text Clustering
Bio-Inspired Methods
PSO-based Clustering
Hybrid Methods
Expectation-Maximization
Initialization Approaches
Clustering
Data mining
Ciencias Informáticas
base de datos
Descripción: Clustering of short texts is an important research area because of its applicability in information retrieval and text mining. To this end was proposed CLUDIPSO, a discrete Particle Swarm Optimization algorithm to cluster short texts. Initial results showed that CLUDIPSO has performed well in small collections of short texts. However, later works showed some drawbacks when dealing with larger collections. In this paper we present a hybridization of CLUDIPSO to overcome these drawbacks, by providing information in the initial cycles of the algorithm to avoid a random search and thus speed up the convergence process. This is achieved by using a pre-clustering obtained with the Expectation-Maximization method which is included in the initial population of the algorithm. The results obtained with the hybrid version show a significant improvement over those obtained with the original version.
Eje: Workshop Bases de datos y minería de datos (WBDDM)
Idioma: Inglés