Safety-Oriented Pruning and Interpretation of Reinforcement Learning Policies

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 01 Jan 2024Embargo end date: 01 Jan 2024Publisher:Ciaco - i6doc.comJournal:ESANN 2024 proceesdingsFunded by:EC | MARS

Authors: Gross, Dennis; Spieker, Helge;

doi: 10.14428/esann/2024.es2024-71 , 10.48550/arxiv.2409.10218 , 10.5281/zenodo.18487797 , 10.5281/zenodo.18487796

arXiv: 2409.10218

Safety-Oriented Pruning and Interpretation of Reinforcement Learning Policies

- Summary
- Subjects
- Metrics

Abstract

Pruning neural networks (NNs) can streamline them but risks removing vital parameters from safe reinforcement learning (RL) policies. We introduce an interpretable RL method called VERINTER, which combines NN pruning with model checking to ensure interpretable RL safety. VERINTER exactly quantifies the effects of pruning and the impact of neural connections on complex safety properties by analyzing changes in safety measurements. This method maintains safety in pruned RL policies and enhances understanding of their safety dynamics, which has proven effective in multiple RL settings.

Related Organizations

Simula Research Laboratory
Norway

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Machine Learning (cs.LG)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green

Funded by

EC| MARS