Seminar第2216讲 An Iterative Scheme of Safe Reinforcement Learning for Nonlinear Systems via Barrier Certificate Generation

创建时间:  2021/11/25  谭福平   浏览次数:   返回

报告题目 (Title):An Iterative Scheme of Safe Reinforcement Learning for Nonlinear Systems via Barrier Certificate Generation

报告人 (Speaker):杨争峰 教授(华东师范大学大学)

报告时间 (Time):2021年11月25日(周四) 19:00

报告地点 (Place):腾讯会议号: 427648902,密码:123456

邀请人(Inviter):曾振柄


报告摘要:In this talk, I will introduce a safe reinforcement learning approach to synthesize deep neural network (DNN) controllers for nonlinear systems subject to safety constraints. The proposed approach employs an iterative scheme where a learner and averifier interact to synthesize safe DNN controllers. Thelearner trains a DNN controller via deep reinforcement learning, and the verifier certifies the learned controller through computing a maximal safe initial region and its corresponding barrier certificate, based on polynomial abstraction and bilinear matrix inequalities solving. Compared with the existing verification-in-the-loop synthesis methods, our iterative framework is a sequential synthesis scheme of controllers and barrier certificates, which can learn safe controllers with adaptive barrier certificates rather than user-defined ones. We implement the tool SRLBC and evaluate its performance over a set of benchmark examples. The experimental results demonstrate that our approach efficiently synthesizes safe DNN controllers even for a nonlinear system with dimension up to 12.

上一条:Seminar第2217讲 Drinfel'd--Sokolov 方程族的可积(半)离散

下一条:Seminar第2215讲 Value-Gradient Formulation for Optimal Control Problem and its Machine-Learning Algorithm

  版权所有 © 上海大学   沪ICP备09014157   沪公网安备31009102000049号  地址:上海市宝山区上大路99号    邮编:200444   电话查询
 技术支持:上海大学信息化工作办公室   联系我们