Kernel Density Based Spatial Clustering of Applications with Noise

Name: Kernel Density Based Spatial Clustering of Applications with Noise
Keywords: Kernel, Technology, T, Electronic computers. Computer science, QA75.5-76.95, DBSCAN, Clustering

Rohan Kalpavruksha; Roshan Kalpavruksha; Teryn Cha; Sung-Hyuk Cha

Found an issue? Give us feedback

Proceedings of the I...arrow_drop_down

Proceedings of the International Florida Artificial Intelligence Research Society Conference

Article . 2025 . Peer-reviewed

License: CC BY NC

Data sources: Crossref

Proceedings of the International Florida Artificial Intelligence Research Society Conference

Article . 2025

Data sources: DOAJ

Kernel Density Based Spatial Clustering of Applications with Noise

descriptionPublicationkeyboard_double_arrow_right Article 14 May 2025Publisher:University of Florida George A Smathers LibrariesJournal:The International FLAIRS Conference Proceedings, volume 38 (issn: 2334-0754, eissn: 2334-0762,

Copyright policy )

Authors: Rohan Kalpavruksha; Roshan Kalpavruksha; Teryn Cha; Sung-Hyuk Cha;

doi: 10.32473/flairs.38.1.138998

Kernel Density Based Spatial Clustering of Applications with Noise

- Summary
- Subjects
- Metrics

Abstract

Density-Based Spatial Clustering of Applications with Noise (DBSCAN) is a widely used clustering algorithm renowned for its ability to identify clusters of arbitrary shapes and detect noise. However, its reliance on fixed parameters, such as the minimum number of points (MinPts) and the epsilon radius (epsilon), makes it sensitive to variations in sample density. This paper reinterprets DBSCAN as a specific case of kernel density estimation (KDE)-based clustering, where the kernel shape corresponds to a hyper-rectangular pillar or cylindrical kernel, depending on the distance metric. Building on this foundation, we introduce a flexible framework incorporating various kernel functions, including uniform, conical, Epanechnikov, cosine, exponential, and Gaussian kernels, to estimate the density distribution of data points. The threshold values are selected to identify high-density regions by retaining the top 90% of points, while excluding low-density points as noise, thereby enhancing clustering precision. Clusters are adaptively formed by leveraging points within the kernel range, thereby increasing the algorithm's robustness to noise and its adaptability to irregular density patterns. Empirical results demonstrate that the proposed approach outperforms traditional DBSCAN, as evidenced by lower Davies-Bouldin indices and higher silhouette scores. This study highlights the potential of density-driven clustering for practical applications, including social media sentiment analysis, customer segmentation in e-commerce, and medical data analysis, particularly in scenarios involving noise-prone or unevenly distributed datasets.

Related Organizations

Pace University
United States

Keywords

Kernel, Technology, T, Electronic computers. Computer science, QA75.5-76.95, DBSCAN, Clustering

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

gold