Safety-Critical Learning of Robot Control With Temporal Logic Specifications

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Aug 2025Embargo end date: 01 Jan 2021Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Transactions on Automatic Control, volume 70, pages 5,553-5,560 (issn: 0018-9286, eissn: 2334-3303,

Copyright policy )Funded by:FCT | D4

Authors: Mingyu Cai; Cristian-Ioan Vasile;

doi: 10.1109/tac.2025.3550850 , 10.48550/arxiv.2109.02791

arXiv: 2109.02791

Safety-Critical Learning of Robot Control With Temporal Logic Specifications

- Summary
- Subjects
- Related research
  (10)
- Metrics

Abstract

Reinforcement learning (RL) is a promising approach. However, success is limited to real-world applications, because ensuring safe exploration and facilitating adequate exploitation is a challenge for controlling robotic systems with unknown models and measurement uncertainties. The learning problem becomes even more difficult for complex tasks over continuous state-action. In this paper, we propose a learning-based robotic control framework consisting of several aspects: (1) we leverage Linear Temporal Logic (LTL) to express complex tasks over infinite horizons that are translated to a novel automaton structure; (2) we detail an innovative reward scheme for LTL satisfaction with a probabilistic guarantee. Then, by applying a reward shaping technique, we develop a modular policy-gradient architecture exploiting the benefits of the automaton structure to decompose overall tasks and enhance the performance of learned controllers; (3) by incorporating Gaussian Processes (GPs) to estimate the uncertain dynamic systems, we synthesize a model-based safe exploration during the learning process using Exponential Control Barrier Functions (ECBFs) that generalize systems with high-order relative degrees; (4) to further improve the efficiency of exploration, we utilize the properties of LTL automata and ECBFs to propose a safe guiding process. Finally, we demonstrate the effectiveness of the framework via several robotic environments. We show an ECBF-based modular deep RL algorithm that achieves near-perfect success rates and safety guarding with high probability confidence during training.

Under Review. arXiv admin note: text overlap with arXiv:2102.12855

Related Organizations

Lehigh University
United States
Lehigh University
Lehigh University
University of California, Riverside
United States
LEHIGH UNIVERSITY

View all View all

Keywords

FOS: Computer and information sciences, Computer Science - Robotics, Computer Science - Machine Learning, Formal Languages and Automata Theory (cs.FL), Computer Science - Formal Languages and Automata Theory, Robotics (cs.RO), Machine Learning (cs.LG)

10 Research products, page 1 of 1

Effect of Vasodilating Drugs on External Carotid and Pulpal Blood Flow in Dogs: “Stealing” of Dental Perfusion Pressure
1976IsAmongTopNSimilarDocuments
High Antioxidant Activity Mixture of Extruded Whole Quality Protein Maize and Common Bean Flours for Production of a Nutraceutical Beverage Elaborated with a Traditional Mexican Formulation
2012IsAmongTopNSimilarDocuments
[Discrepancy between the blood flow in hyperperfused epicardial coronary vessels and myocardial microcirculation following reperfusion: a study in canines].
2004IsAmongTopNSimilarDocuments
Estimated Cerebral Blood Flow in Term Infants with Hypoxic-Ischemic Encephalopathy
1981IsAmongTopNSimilarDocuments
Canine external carotid vasoconstriction to methysergide, ergotamine and dihydroergotamine: role of 5‐HT_1B/1D receptors and α₂‐adrenoceptors
1999IsAmongTopNSimilarDocuments
Convective Heat Transfer Steady Thermal Stress in a ZrO<sub>2</sub>/FGM/Ti-6Al-4V Composite ECBF Plate with Temperature-Dependent Material Properties
2010IsAmongTopNSimilarDocuments
Partial or Total Extracorporeal Support
2017IsAmongTopNSimilarDocuments
Detection of Delayed Cerebral Ischemia (DCI) in Subarachnoid Haemorrhage Applying Near-Infrared Spectroscopy: Elimination of the Extracerebral Signal by Transcutaneous and Intraparenchymatous Measurements in Parallel
2014IsAmongTopNSimilarDocuments
Enhancement of chromatographic spectral technique applied to a high‐speed train
2021IsAmongTopNSimilarDocuments
Enhancement of chromatographic spectral technique applied to a high-speed train
2021IsAmongTopNSimilarDocuments

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

1

Average

Green

Fields of Science

engineering and technology

industrial biotechnology

Fields of Science

engineering and technology

industrial biotechnology

Funded by

FCT| D4