Bias-variance decomposition in Genetic Programming

descriptionPublicationkeyboard_double_arrow_right Article 01 Jan 2016 United Kingdom English Publisher:Walter de Gruyter GmbHJournal:Open Mathematics, volume 14, pages 62-80 (eissn: 2391-5455,

Copyright policy )

Authors: Kowaliw, T; Doursat, R;

doi: 10.1515/math-2016-0005

Bias-variance decomposition in Genetic Programming

- Summary
- Subjects
- Metrics

Abstract

Abstract We study properties of Linear Genetic Programming (LGP) through several regression and classification benchmarks. In each problem, we decompose the results into bias and variance components, and explore the effect of varying certain key parameters on the overall error and its decomposed contributions. These parameters are the maximum program size, the initial population, and the function set used. We confirm and quantify several insights into the practical usage of GP, most notably that (a) the variance between runs is primarily due to initialization rather than the selection of training samples, (b) parameters can be reasonably optimized to obtain gains in efficacy, and (c) functions detrimental to evolvability are easily eliminated, while functions well-suited to the problem can greatly improve performance—therefore, larger and more diverse function sets are always preferable.

Country

United Kingdom

Related Organizations

Institut des Systèmes Complexes - Paris Ile de France
France
French National Centre for Scientific Research
France
Centre national de la recherche scientifique
France
Manchester Metropolitan University
United Kingdom

Keywords

computational learning theory, 68q32, Analysis of variance and covariance (ANOVA), Learning and adaptive systems in artificial intelligence, Computational learning theory, 68w40, non-parametric inference, 68t05, bias-variance decomposition, analysis of algorithms, classification, evolutionary computation, 62g08, QA1-939, learning and adaptive systems, Analysis of algorithms, genetic programming, regression, Nonparametric regression and quantile regression, 62j10, Mathematics

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	7
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

7

Top 10%

Average

gold

Fields of Science (4) View all

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

View all