Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ Jurnal Sistem Komput...arrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
Jurnal Sistem Komputer dan Informatika (JSON)
Article . 2023 . Peer-reviewed
License: CC BY
Data sources: Crossref
addClaim

Analisis Perbandingan Kinerja Clustering Data Mining Untuk Normalisasi Dataset

Authors: Siti Emalia Saqila; Intan Putri Ferina; Agus Iskandar;

Analisis Perbandingan Kinerja Clustering Data Mining Untuk Normalisasi Dataset

Abstract

Nowadays, the development and influence of technology in human life is very important, where the role of technology greatly influences the activities carried out by humans. In a company organization, technology is not only used as a process to speed up the processes carried out. The use of such important technology also increases the size or volume of available data information. A dataset is a collection of data obtained in a data warehouse. Data mining is a technique that is part of Knowledge Discovery in Database (KDD). Clustering is a grouping process carried out in data mining. The first problem that is central to the research is that the values obtained from the clustering process are sometimes still not considered optimal. The performance results of the data mining clustering algorithm cannot yet be fully used as a basis for decision making. Comparisons made in clustering data mining are used to assist in the decision making process. In this research, the algorithms that will be used for comparison of performance are the K-Means and K-Medoids algorithms. Another problem that needs special attention is the problem of data quality. The results obtained from the data mining process can be seen from the quality of the data stored or used in the data processing process. Normalization is part of preprocessing data mining which aims to re-reason it based on a new scale. Z-Score is a normalization carried out on data based on statistical functions. The results obtained in the research The role of normalization in the research is very important, this is because using Z-Score normalization can improve the performance of the K-Means and K-Medoids algorithms, this can be seen from the DBI value obtained which is smaller when normalization is carried out compared to before it is carried out normalization, which indicates that performance is better after normalization. In the comparison of algorithms, the K-Medoids algorithm gets better performance, this can be seen from the DBI value obtained at 0.773 at K=9 after normalization. Meanwhile, the K-Means algorithm obtained a value of 0.783 at K=9 after normalization as well

Related Organizations
  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
gold