极速赛车官网

当前位置: 极速赛车官网 > 极速赛车官网 > 学术活动 > 正文

A Two-fold Randomization Framework for Impulse Control Problems

发布日期:2025-12-03点击数:

报告人:董玉超 副研究员(同济大学)

时间:2025年12月08日 09:00-

地址:极速赛车官网 LD718


摘要:We propose and analyze a randomization scheme for a general class of impulse control problems. The solution to this randomized problem is characterized as the fixed point of a compound operator which consists of a regularized nonlocal operator and a regularized stopping operator. This approach allows us to derive a semi-linear Hamilton-Jacobi-Bellman (HJB) equation. Through an equivalent randomization scheme with a Poisson compound measure, we establish a verification theorem that implies the uniqueness of the solution. Via an iterative approach, we prove the existence of the solution. The existence--and--uniqueness result ensures the randomized problem is well-defined. We then demonstrate that our randomized impulse control problem converges to its classical counterpart as the randomization parameter \(\blambda\) vanishes. This convergence, combined with the value function's \(\mathcal{C}^{2,\alpha}_{loc}\) regularity, confirms our framework provides a robust approximation and a foundation for developing learning algorithms. Under this framework, we propose an offline reinforcement learning (RL) algorithm. Its policy improvement step is naturally derived from the iterative approach from the existence proof, which enjoys a geometric convergence rate. We implement a model-free version of the algorithm and numerically demonstrate its effectiveness using a widely-studied example. The results show that our RL algorithm can learn the randomized solution, which accurately approximates its classical counterpart. A sensitivity analysis with respect to the volatility parameter \(\sigma\) in the state process effectively demonstrates the exploration--exploitation tradeoff.


邀请人:张志


欢迎广大师生积极参与!


关于我们
极速赛车官网 的前身是始建于1929年的重庆大学理极速赛车官网 和1937年建立的重庆大学商极速赛车官网 ,理极速赛车官网 是重庆大学最早设立的三个极速赛车官网 之一,首任院长为数学家何鲁先生。