
Abstract We study properties of Linear Genetic Programming (LGP) through several regression and classification benchmarks. In each problem, we decompose the results into bias and variance components, and explore the effect of varying certain key parameters on the overall error and its decomposed contributions. These parameters are the maximum program size, the initial population, and the function set used. We confirm and quantify several insights into the practical usage of GP, most notably that (a) the variance between runs is primarily due to initialization rather than the selection of training samples, (b) parameters can be reasonably optimized to obtain gains in efficacy, and (c) functions detrimental to evolvability are easily eliminated, while functions well-suited to the problem can greatly improve performance—therefore, larger and more diverse function sets are always preferable.
computational learning theory, 68q32, Analysis of variance and covariance (ANOVA), Learning and adaptive systems in artificial intelligence, Computational learning theory, 68w40, non-parametric inference, 68t05, bias-variance decomposition, analysis of algorithms, classification, evolutionary computation, 62g08, QA1-939, learning and adaptive systems, Analysis of algorithms, genetic programming, regression, Nonparametric regression and quantile regression, 62j10, Mathematics
computational learning theory, 68q32, Analysis of variance and covariance (ANOVA), Learning and adaptive systems in artificial intelligence, Computational learning theory, 68w40, non-parametric inference, 68t05, bias-variance decomposition, analysis of algorithms, classification, evolutionary computation, 62g08, QA1-939, learning and adaptive systems, Analysis of algorithms, genetic programming, regression, Nonparametric regression and quantile regression, 62j10, Mathematics
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 7 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
