Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ Liriasarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
Lirias
Conference object . 2008
Data sources: Lirias
versions View all 1 versions
addClaim

This Research product is the result of merged Research products in OpenAIRE.

You have already added 0 works in your ORCID record related to the merged Research product.

Using decision trees as the answer network in temporal-difference networks

Authors: Antanas, Laura; Driessens, Kurt; Croonenborghs, Tom; Ramon, Jan;

Using decision trees as the answer network in temporal-difference networks

Abstract

Temporal difference networks (or TD-Nets) offer a framework for predictive state representations. TD-Nets break up into two parts: the question network and the answer network. The question network defines which questions about future observations are of importance, while the answer network provides a way to update the answers to those questions as the environment changes. Currently, TD-Nets use logistic regression functions to represent the answer networks. We propose the use of probability trees in their stead. Trees offer a different but powerful way of generalisation and using them may be beneficial in a number of applications. Moreover, we believe this aids in a better understanding of the strengths and weaknesses of TD-Nets and represents an important first step towards the application of temporal difference networks in environments with more extensive, i.e. complex and numerous, observations than those currently employed. We compare the learning behavior of TD-Nets using logistic regression and probability trees using an array of experiments in two simple grid worlds and a ring world.

ispartof: European Conference on Artificial Intelligence location:Patras, Greece date:21 Jul - 25 Jul 2008

ispartof: Proceedings of the 18th European Conference on Artificial Intelligence vol:178 pages:847-848

ispartof: pages:847-848

acceptance rate = 23%

status: published

Country
Belgium
Related Organizations
Keywords

temporal-difference networks, probability trees

  • BIP!
    Impact byBIP!
    citations
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
citations
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
Green