Learning Relational Dynamics of Stochastic Domains for Planning

Name: Learning Relational Dynamics of Stochastic Domains for Planning
Keywords: [INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI], Planning operators, 330, :Informàtica::Automàtica i control [Àrees temàtiques de la UPC], :Optimisation [Classificació INSPEC], Scheduling, [INFO.INFO-RB] Computer Science [cs]/Robotics [cs.RO], Classificació INSPEC::Optimisation::Mathematical programming::Stochastic programming, Classificació INSPEC::Optimisation, Stochastic domains

Martínez Martínez, David; Alenyà Ribas, Guillem; Torras, Carme; Ribeiro, Tony; Inoue, Katsumi

Found an issue? Give us feedback

downloadFull-Text

Recolector de Cienci...arrow_drop_down

Recolector de Ciencia Abierta, RECOLECTA

Conference object . 2016 . Peer-reviewed

Full-Text: http://icaps16.icaps-conference.org/

Data sources: Recolector de Ciencia Abierta, RECOLECTA

UPCommons. Portal del coneixement obert de la UPC

Conference object . 2016 . Peer-reviewed

License: CC BY NC ND

Full-Text: https://upcommons.upc.edu/bitstreams/d2461523-8a55-4e29-b448-8758d5951c8a/download

Data sources: UPCommons. Portal del coneixement obert de la UPC

Proceedings of the International Conference on Automated Planning and Scheduling

Article . 2016 . Peer-reviewed

Data sources: Crossref

Recolector de Ciencia Abierta, RECOLECTA

Conference object . 2016 . Peer-reviewed

License: CC BY NC ND

Data sources: Recolector de Ciencia Abierta, RECOLECTA

Recolector de Ciencia Abierta, RECOLECTA

Conference object . 2016 . Peer-reviewed

Data sources: Recolector de Ciencia Abierta, RECOLECTA

Recolector de Ciencia Abierta, RECOLECTA

Conference object . 2016 . Peer-reviewed

License: CC BY NC ND

Data sources: Recolector de Ciencia Abierta, RECOLECTA

INRIA2

Conference object . 2016

Data sources: INRIA2

INRIA a CCSD electronic archive server

Conference object . 2016

Data sources: INRIA a CCSD electronic archive server

DBLP

Conference object

Data sources: DBLP

Learning Relational Dynamics of Stochastic Domains for Planning

descriptionPublicationkeyboard_double_arrow_right Article , Conference object 30 Mar 2016 Spain, Japan, France Publisher:Association for the Advancement of Artificial Intelligence (AAAI)Journal:Proceedings of the International Conference on Automated Planning and Scheduling, volume 26, pages 235-243 (issn: 2334-0835, eissn: 2334-0843,

Copyright policy )

Authors: Martínez Martínez, David; Alenyà Ribas, Guillem; Torras, Carme; Ribeiro, Tony; Inoue, Katsumi;

doi: 10.1609/icaps.v26i1.13746

handle: 10261/133020 , 2117/103612

Learning Relational Dynamics of Stochastic Domains for Planning

- Summary
- Subjects
- Related research
  (19)
- Metrics

Abstract

Probabilistic planners are very flexible tools that can provide good solutions for difficult tasks. However, they rely on a model of the domain, which may be costly to either hand code or automatically learn for complex tasks. We propose a new learning approach that (a) requires only a set of state transitions to learn the model; (b) can cope with uncertainty in the effects; (c) uses a relational representation to generalize over different objects; and (d) in addition to action effects, it can also learn exogenous effects that are not related to any action, e.g., moving objects, endogenous growth and natural development. The proposed learning approach combines a multi-valued variant of inductive logic programming for the generation of candidate models, with an optimization method to select the best set of planning operators to model a problem. Finally, experimental validation is provided that shows improvements over previous work.

Countries

Spain, Japan, France

Related Organizations

French Institute for Research in Computer Science and Automation
France
Institute of Robotics and Industrial Informatics
Spain
Universitat Polite`cnica de Catalunya
Spain
University of Nantes
France
Institute of Science Tokyo
Japan

View all View all

Keywords

[INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI], Planning operators, 330, :Informàtica::Automàtica i control [Àrees temàtiques de la UPC], :Optimisation [Classificació INSPEC], Scheduling, [INFO.INFO-RB] Computer Science [cs]/Robotics [cs.RO], Classificació INSPEC::Optimisation::Mathematical programming::Stochastic programming, Classificació INSPEC::Optimisation, Stochastic domains, [INFO.INFO-LG] Computer Science [cs]/Machine Learning [cs.LG], [INFO] Computer Science [cs], :Optimisation::Mathematical programming::Stochastic programming [Classificació INSPEC], Endogenous growth, State transitions, Experimental validations, Àrees temàtiques de la UPC::Informàtica::Automàtica i control, Optimization method, Inductive logic programming (ILP), Relational representations, Stochastic Systems, Learning approach

19 Research products, page 1 of 2

Combining Local-Physical and Global-Statistical Models for Sequential Deformable Shape from Motion
2016IsAmongTopNSimilarDocuments
Multi-modal joint embedding for fashion product retrieval
2017IsAmongTopNSimilarDocuments
Robot motion adaptation through user intervention and reinforcement learning
2018IsAmongTopNSimilarDocuments
Active garment recognition and target grasping point detection using deep learning
2018IsAmongTopNSimilarDocuments
Structured Prediction with Output Embeddings for Semantic Image Annotation
2016IsAmongTopNSimilarDocuments
Personalization Framework for Adaptive Robotic Feeding Assistance
2016IsAmongTopNSimilarDocuments
Mode-shape interpretation: Re-thinking modal space for recovering deformable shapes
2016IsAmongTopNSimilarDocuments
Accurate and Linear Time Pose Estimation from Points and Lines
2016IsAmongTopNSimilarDocuments
Recovering Pose and 3D Deformable Shape from Multi-instance Image Ensembles
2017IsAmongTopNSimilarDocuments
Modal Space: A Physics-Based Model for Sequential Estimation of Time-Varying Shape from Monocular Video
2016IsAmongTopNSimilarDocuments

chevron_left
1
2
chevron_right

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	10
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

10

Top 10%

Average

Green

gold

Related to Research communities

INRIA