An Attempt at Improving Density-based Clustering Algorithms

Clustering is an unsupervised analytical technique for processing data that works by grouping elements of a set in order to form clusters of similar items. This task lies at the base level of many other tasks including machine vision and artificial intelligence. Big data sets present challenges for clustering due to the size and complexity of the data to be processed. Previous work in this domain resulted in an algorithm called Fast Density-Grid Clustering, which is designed to create a grid structure on the data and then merging cells based on local density. In this paper, we focus on an ongoing modification to this algorithm that could also be used for other density-based clustering algorithms, based on adaptive grids that have an irregular spacing. Testing shows that while the initial results for the first stage of implementation resulted in an average 29.728% loss in accuracy with no significant speed increase, there is a lot of further room for experimentation and development for this approach.

Related Organizations

Kennesaw State University
United States

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	7
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%