Título: A FUZZY CHARACTERISTIC SELECTION AND EXTRACTION FOR TEXT CATEGORIZATION
Autores: Koorapati, Jhansi; M.Tech Student, Dept of CSE, CMR Technical Campus, Kandlakoya (V) Medchal (M), A.P, India
Lakshmi, G.Vijaya; M.Tech Student, Dept of CSE, CMR Technical Campus, Kandlakoya (V) Medchal (M), A.P, India
Nagaraju, R.; Asst. Professor, Dept of CSE, CMR Technical Campus, Kandlakoya (V) Medchal (M), A.P, India
Fecha: 2013-06-02
Publicador: International journal of computer and electronics research
Fuente:
Tipo: Peer-reviewed Article
Tema: Dimensionality; Clustering; Feature Selection; Text Classification; High Dimensional; Data Sets
Descripción: The dimensionality of the characteristic vector is generally enormous in text classification. Such high dimensionality can be a ruthless obstacle for classification algorithms. To reduce the dimensionality of feature vectors for text classification a powerful method called feature clustering is used. A fuzzy similarity-based self-constructing algorithm for feature clustering is proposed in this paper. Based on similarity test the words in the feature vector of a document set are grouped into clusters. Words that are related to each other are grouped into the same cluster. Each cluster is distinguished by a membership function with statistical mean and divergence. A desired number of clusters are formed automatically when all the words have been fed in and we have one extracted feature for each cluster. The extracted feature which is equivalent to a cluster is a weighted combination of the words contained in the cluster. By this algorithm, the resulting membership functions match closely with and describe properly the real sharing of the training data. Besides, for determining the appropriate number of extracted features which can then be avoided as the user need not specify the number of extracted features in advance, and trial-and-error. Our simulation results show that our technique can run faster and attain better extracted features than other methods.
Idioma: Inglés

Artículos similares:

A hybrid k-Mean-GRASP for partition based Clustering of two-dimensional data space as an application of p-median problem por Nadella, Sunil; Associate Professor,P.G. Dept of Computer Science, Ideal College of Arts & Sciences, Kakinada,M V S V, Kiranmai; Lecturer ,Department of CSE,University College of Engineering,JNTU Kakinada,Gugulotu, Narasimha; Department of CSE JNTUH College of Engineering, Nachupally, Kondagattu,Karimnagar
Architecture of Mobile application, Security issues and Services involved in Mobile Cloud Computing Environment por Saini, Swarnpreet SIngh; Dept. of Computer Science Engineering CT Institute of Engineering and Management Technology, Jalandhar,Bagga, Ritu; Dept. of Computer Science Engineering CT Institute of Engineering and Management Technology, Jalandhar,Singh, Devinder; Dept. of Computer Science Engineering CT Institute of Engineering and Management Technology, Jalandhar,Jangwal, Tarun; Dept. of Computer Science Engineering CT Institute of Engineering and Management Technology, Jalandhar
A Novel approach of Hybrid Method of Hiding the Text Information Using Stegnography por R, Thamaraiselvan; Assistant Professor Dept. of Computer Applications M.G.R.College Dr. MGR Nagar Hosur - 635109 TN, India,Saradha, A.; Professor and Head, Department of Computer Science and Engineering, Institute of Road and Transport Technology, Erode, TN, India
RSA Algorithm Implementation for Ciphering Medical Imaging por Ali, Samoud; Signal processing Laboratory - Science Faculty of Tunis , 1060 Tunis.,Adnen, Cherif; Signal processing Laboratory - Science Faculty of Tunis , 1060 Tunis.
A STUDY ON KNOWLEDGE ACQUISITION APPROACH FOR COMPOSITE AEROSPACE COMPONENT DESIGN por Sivaraman, G; Assistant Professor & HeadDept. of Computer ApplicationsM.G.R.CollegeDr. MGR NagarHosur - 635109TN
Corpus based Emotion Extraction to implement prosody feature in Speech Synthesis Systems por Chandak, Manoj B; DEPTT. OF COMPUTER SCIENCE AND ENGG S.R.K.N.E.C NAGPUR UNIVERSITY,Bhutekar, Swati
A NEW DATA HIDING METHOD USING PIXEL POSITION AND LOGICAL AND OPERATION por Saini, Ravi; C.M.R.A., GP Sanghi,Rohtak, Haryana,Yadav, Rajkumar; U.I.E.T, Maharshi Dayanand University, Rohtak
A New Focus on Distance Learning for Physically Impaired Students: A Multi-agent Oriented Approach por Pujari, Shiladitya; University Institute of Technology, Burdwan University,Mondal, Subrata
Productivity Inference with Dynamic Bayesian Models in Software Development Projects por Naman, Abou Bakar; Department of Comptuer Science & Information Technology Sarhad University,Lali, M.Ikram.; University of education Lahore, Attock campus
10