上海大学学报(自然科学版) ›› 2023, Vol. 29 ›› Issue (4): 651-665.doi: 10.12066/j.issn.1007-2861.2519

• • 上一篇    下一篇

深度确定性策略梯度算法耦合模型驱动的行人过街仿真 

宋涛1,2 , 王彦林1 , 魏昕恺1 , 韦艳芳3    

  1. 1. 湖州师范学院 理学院, 浙江 湖州 313000; 2. 湖州市数据建模与分析重点实验室, 浙江 湖州 313000; 3. 玉林师范学院 物理与电信工程学院, 广西 玉林 537000
  • 收稿日期:2023-05-16 出版日期:2023-08-30 发布日期:2023-09-01
  • 通讯作者: 宋 涛 (1980—), 男, 博士, 研究方向为交通科学和交通大数据分析.
  • 基金资助:
    国家自然科学基金资助项目 (12271158); 浙江省教育厅科研资助项目 (Y202248528); 湖州市科技计划资 助项目 (2023YZ28); 国家级大学生创新创业训练计划资助项目 (202210347046) 

Pedestrian crossing simulation driven by a coupled model of deepdeterministic policy gradient algorithm 

SONG Tao 1,2 , WANG Yanlin 1 , WEI Xinkai 1 , WEI Yanfang   

  1. 1. School of Science, Huzhou University, Huzhou 313000, China; 2. Huzhou Key Laboratory of Data Modeling and Analysis, Huzhou 313000, China; 3. School of Physics and Telecommunication Engineering, Yulin Normal University, Yulin 537000, China
  • Received:2023-05-16 Online:2023-08-30 Published:2023-09-01

摘要:

行人仿真在公共安全研究中扮演着重要的角色, 但如何增强仿真环境中智能体的真实性一直是一个难题. 本工作提出了一种深度确定性策略梯度-行人二维优化速度模型耦合驱动算法, 并对无信号交叉口行人过街行为进行仿真. 通过构建考虑无速度差项和有速度差项 2 种行人二维优化速度模型的策略探索方案, 揭示出带有速度差项行人二维优化速度模型的耦合驱动算法倾向于灵活地选择相对安全的动作, 从而使行人选择动作的策略也更优, 达到了完全避免行人碰撞、确保行人安全的功能. 

关键词: 深度确定性策略梯度, 二维优化速度模型, 碰撞, 无信号交叉口, 行人仿真

Abstract:

The simulation of pedestrian flow plays an important role in public safety research. However, the enhancement of agent authenticity in simulation environments remains a challenge. Herein, we propose simulating pedestrian crossing behaviour at unsignalized intersections using a deep deterministic policy gradient algorithm to drive a two-dimensional optimal velocity pedestrian model. We constructed a strategy exploration scheme that considered two types of two-dimensional optimal velocity pedestrian models: non-velocity and velocity difference terms. The analysis revealed that the model considering the velocity difference term tended to flexibly select relatively safe actions, thus its action-selection strategy was considered optimal. Furthermore, this model completely avoided pedestrian collisions and ensured pedestrian safety. 

Key words: deep deterministic policy gradient, two-dimensional optimal velocity model; collision, unsigned intersections, pedestrian simulation 

中图分类号: