Performance of a K-Means Algorithm Driven by Careful Seeding

Name: Performance of a K-Means Algorithm Driven by Careful Seeding
Keywords: Clustering Accuracy Indexes, K-Means Clustering, Greedy K-Means++, Benchmark and Real-World Datasets, Seeding Procedure, Execution Performance., K-Means clustering, Seeding procedure, Greedy K-Means++, Clustering accuracy indexes, Java parallel streams, Benchmark and real-world datasets, Execution performance, Java Parallel Streams

Libero, Nigro; Franco, Cicirelli

Found an issue? Give us feedback

downloadFull-Text

IRIS Cnrarrow_drop_down

IRIS Cnr

Conference object . 2023

Full-Text: https://iris.cnr.it/request-item?handle=20.500.14243/461867&bitstreamId=a29ad0d5-159d-4fd7-a558-c6f66f0f6ba2

Data sources: IRIS Cnr

CNR ExploRA

Conference object . 2023

Data sources: CNR ExploRA

Archivio Istituzionale dell'Università della Calabria

Conference object . 2023

Data sources: Archivio Istituzionale dell'Università della Calabria

https://doi.org/10.5220/001204...

Article . 2023 . Peer-reviewed

Data sources: Crossref

Performance of a K-Means Algorithm Driven by Careful Seeding

descriptionPublicationkeyboard_double_arrow_right Article , Conference object 01 Jan 2023Publisher:SCITEPRESS - Science and Technology PublicationsJournal:Proceedings of the 13th International Conference on Simulation and Modeling Methodologies, Technologies and Applications

Authors: Libero, Nigro; Franco, Cicirelli;

doi: 10.5220/0012045000003546

handle: 20.500.14243/461867 , 20.500.11770/354142

Performance of a K-Means Algorithm Driven by Careful Seeding

- Summary
- Subjects
- Metrics

Abstract

This paper proposes a variation of the K-Means clustering algorithm, named Population-Based K-Means (PBK-MEANS), which founds its behaviour on careful seeding. The new K-Means algorithm rests on a greedy version of the K-Means++ seeding procedure (g_kmeans++), which proves effective in the search for an accurate clustering solution. PB-K-MEANS first builds a population of candidate solutions by independent runs of K-Means with g_kmeans++. Then the reservoir is used for recombining the stored solutions by Repeated K-Means toward the attainment of a final solution which minimizes the distortion index. PB-KMEANS is currently implemented in Java through parallel streams and lambda expressions. The paper first recalls basic concepts of clustering and of K-Means together with the role of the seeding procedure, then it goes on by describing basic design and implementation issues of PB-K-MEANS. After that, simulation experiments carried out both on synthetic and real-world datasets are reported, confirming good execution performance and careful clustering.

Related Organizations

University of Calabria
Italy
National Research Council
Italy
National Research Council
Sri Lanka
Institute for high performance computing and networking
Italy

Keywords

Clustering Accuracy Indexes, K-Means Clustering, Greedy K-Means++, Benchmark and Real-World Datasets, Seeding Procedure, Execution Performance., K-Means clustering, Seeding procedure, Greedy K-Means++, Clustering accuracy indexes, Java parallel streams, Benchmark and real-world datasets, Execution performance, Java Parallel Streams

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	4
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

4

Top 10%

Average

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now