Climate Classifications: the Value of Unsupervised Clustering

descriptionPublicationkeyboard_double_arrow_right Article , Conference object 01 Jan 2012 English Publisher:Elsevier BVJournal:Procedia Computer Science, volume 9, pages 897-906 (issn: 1877-0509,

Copyright policy )

Authors: Jakob Zscheischler; Miguel D. Mahecha; Stefan Harmeling;

doi: 10.1016/j.procs.2012.04.096

handle: 11858/00-001M-0000-000E-FDED-F , 11858/00-001M-0000-000E-DDDA-E , 11858/00-001M-0000-000E-DDD9-0 , 11858/00-001M-0000-0013-B728-7

Climate Classifications: the Value of Unsupervised Clustering

- Summary
- Subjects
- Metrics

Abstract

AbstractClassifying the land surface according to different climate zones is often a prerequisite for global diagnostic or predictive modelling studies. Classical classiﬁcations such as the prominent K̈oppen–Geiger (KG) approach rely on heuristic decision rules. Although these heuristics may transport some process understanding, such a discretization may appear “arbitrary” from a data oriented perspective. In this contribution we compare the precision of a KG classiﬁcation to an unsupervised classiﬁcation (k-means clustering). Generally speaking, we revisit the problem of “climate classiﬁcation” by investigating the inherent patterns in multiple data streams in a purely data driven way. One question is whether we can reproduce the KG boundaries by exploring different combinations of climate and remotely sensed vegetation variables. In this context we also investigate whether climate and vegetation variables build similar clusters. In terms of statistical performances, k-means clearly outperforms classical climate classiﬁcations. However, a subsequent stability analysis only reveals a meaningful number of clusters if both climate and vegetation data are considered in the analysis. This is a setback for the hope to explain vegetation by means of climate alone. Clearly, classiﬁcation schemes like K̈oppen-Geiger will play an important role in the future. However, future developments in this area need to be assessed based on data driven approaches.

Related Organizations

Max Planck Society
Germany
Max Planck Institute for Intelligent Systems
Germany
Max-Planck-Institute for Biochemistry
Germany
Max Planck Institute of Biochemistry
Germany
Max Planck Institute for Biogeochemistry
Germany

Keywords

PCA, multivariate statistics, k-means, K̈oppen-Geiger climate classiﬁcation, clustering

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	71
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%