Information theoretic learning with K nearest neighbors : a new clustering algorithm

Name: Information theoretic learning with K nearest neighbors : a new clustering algorithm
Creator: Vikjord, Vidar Vangen
Keywords: VDP::Mathematics and natural science: 400::Information and communication science: 420::Knowledge based systems: 425, VDP::Matematikk og Naturvitenskap: 400::Informasjons- og kommunikasjonsvitenskap: 420::Kunnskapsbaserte systemer: 425, FYS-3921

descriptionPublicationkeyboard_double_arrow_right Master thesis 01 Jan 2012 English Publisher:Universitetet i Tromsø

Authors: Vikjord, Vidar Vangen;

handle: 10037/4608

Information theoretic learning with K nearest neighbors : a new clustering algorithm

- Summary
- Subjects
- Related research
  (4)
- Metrics

Abstract

The machine learning field based on information theory has received a lot of attention in recent years. Through kernel estimation of the probability density functions, methods developed with information theoretic measures are able to use all the statistical information available in the data, not just a finite number of moments. However, by using kernel estimation, the methods are dependent on choosing a suitable bandwidth parameter and have trouble dealing with data which vary on different scales. In this thesis, the field of information theoretic learning has been explored using k-nearest neighbor estimates for the probability density functions instead. The developed estimators of the information theoretic measures was used in a clustering routine and compared with the traditional kernel estimators.Performing clustering on a range of datasets and comparing the performance, the new method proved to provide superior results without the need of tuning any parameters. The performance difference was found to be especially large when clustering datasets where groups were on different scales.

Related Organizations

The Arctic University of Norway
Norway

Keywords

VDP::Mathematics and natural science: 400::Information and communication science: 420::Knowledge based systems: 425, VDP::Matematikk og Naturvitenskap: 400::Informasjons- og kommunikasjonsvitenskap: 420::Kunnskapsbaserte systemer: 425, FYS-3921

4 Research products, page 1 of 1

Towards the Semantic Desktop
2010IsAmongTopNSimilarDocuments
IMP: a multi-species functional genomics portal for integration, visualization and prediction of protein functions and networks
2012IsAmongTopNSimilarDocuments
A Scatter-Based Prototype Framework and Multi-Class Extension of Support Vector Machines
2012IsAmongTopNSimilarDocuments
Mean shift spectral clustering using kernel entropy component analysis
2012IsAmongTopNSimilarDocuments

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Average

Green

Related to Research communities

UArctic

Information theoretic learning with K nearest neighbors : a new clustering algorithm

Information theoretic learning with K nearest neighbors : a new clustering algorithm

4 Research products, page 1 of 1

Towards the Semantic Desktop

IMP: a multi-species functional genomics portal for integration, visualization and prediction of protein functions and networks

A Scatter-Based Prototype Framework and Multi-Class Extension of Support Vector Machines

Mean shift spectral clustering using kernel entropy component analysis