Behavior Cloning by a Self-Organizing Decision Tree

Kao-Shing Hwang; Yu-Jen Chen; Tsan-Hui Yang

Found an issue? Give us feedback

https://doi.org/10.1...arrow_drop_down

https://doi.org/10.1109/icitec...

Article . 2007 . Peer-reviewed

Data sources: Crossref

https://dx.doi.org/10.1109/ici...

Article

Data sources: Microsoft Academic Graph

Behavior Cloning by a Self-Organizing Decision Tree

descriptionPublicationkeyboard_double_arrow_right Article 01 Mar 2007Publisher:IEEEJournal:2007 IEEE International Conference on Integration Technology

Authors: Kao-Shing Hwang; Yu-Jen Chen; Tsan-Hui Yang;

doi: 10.1109/icitechnology.2007.4290416

Behavior Cloning by a Self-Organizing Decision Tree

- Summary
- Metrics

Abstract

It is hard to define a state space or the proper reward function in reinforcement learning to make the robot act as expected. In this paper, we demonstrate the expected behavior for a robot Then a RL-based decision tree approach which decides to split according to long-term evaluations, instead of a top-down greedy strategy which finds out the relationship between the input and output from the demonstration data. We use this method to teach a robot for target seeking problem. In order to promote the performance in tackling target seeking problem, we add a Q-learning along with the state space based on RL-based decision tree. The experiment result shows that Q-Iearning can promote the performance quickly. For demonstration, we build a mobile robot powered by an embedded board. The robot can detect the hall of the range in any direction with omni-directional vision system. With such powerful embedded computing capability and the efficient machine vision system, the robot can inherit the learned behavior from a simulator which has learned the empirical behavior and continue to learn with Q-learning to improve the performance of target seeking problem.

Related Organizations

National Chung Cheng University
Taiwan

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	2
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

2

Average

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now