Comparative Study of Clustering Algorithms in Text Mining Context

Abdennour Mohamed Jalil; Imad Hafidi; Lamiae Alami; Ensa Khouribga

Author	Abdennour Mohamed Jalil Imad Hafidi Lamiae Alami Ensa Khouribga
Keywords	Data Mining Clustering Algorithms Text Classification
Abstract	The spectacular increasing of Data is due to the appearance of networks and smartphones. Amount 42% of world population using internet [1]; have created a problem related of the processing of the data exchanged, which is rising exponentially and that should be automatically treated. This paper presents a classical process of knowledge discovery databases, in order to treat textual data. This process is divided into three parts: preprocessing, processing and post-processing. In the processing step, we present a comparative study between several clustering algorithms such as KMeans, Global KMeans, Fast Global KMeans, Two Level KMeans and FWKmeans. The comparison between these algorithms is made on real textual data from the web using RSS feeds. Experimental results identified two problems: the first one quality results which remain for algorithms, which rapidly converge. The second problem is due to the execution time that needs to decrease for some algorithms.
Year of Publication	2016
Journal	International Journal of Interactive Multimedia and Artificial Intelligence
Volume	3
Issue	Regular Issue
Number	7
Number of Pages	42-45
Date Published	06/2016
ISSN Number	1989-1660
Citation Key
URL	http://www.ijimai.org/journal/sites/default/files/files/2016/05/ijimai20163_7_6_pdf_27159.pdf
DOI	10.9781/ijimai.2016.376
	DOI Google Scholar BibTeX EndNote X3 XML EndNote 7 XML Endnote tagged Marc RIS
Attachment	ijimai20163_7_6.pdf2.28 MB