Modeling pedestrian-cyclist interactions in shared space using inverse reinforcement learning

首页> 外文期刊>Transportation research >Modeling pedestrian-cyclist interactions in shared space using inverse reinforcement learning

【24h】

Modeling pedestrian-cyclist interactions in shared space using inverse reinforcement learning

机译：使用逆强化学习对共享空间中的行人与骑行者交互进行建模

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The objective of this study is to model the microscopic behaviour of mixed traffic (cyclistpedestrian) interactions in non-motorized shared spaces. Video data were collected at two locations of Robson Square non-motorized shared space in downtown Vancouver, British Columbia. Trajectories of cyclists and pedestrians involved in interactions were extracted using computer vision algorithms. The extracted trajectories were used to obtain several variables that describe elements of road users' behaviour including longitudinal and lateral distances, speed and speed differences, interaction angle, and cyclist acceleration and yaw rate. The road users behaviour was modeled as utility-based intelligent rational agents using the finite-state Markov Decision Process (MDP) framework with unknown reward functions. The study implemented Inverse Reinforcement Learning (IRL) using two algorithms: the Maximum Entropy (ME) algorithm, and the Feature Matching (FM) algorithm to recover/estimate the reward function weights of cyclists in two types of interactions with pedestrians: following and overtaking interactions. Reward function weights infer cyclist preferences during their interactions with pedestrians in non-motorized shared spaces, and can form the key component in developing agent based microsimulation model for road users. Furthermore, the estimated reward functions were used to estimate cyclists' optimal policy for such interactions. A simulation platform was developed using the estimated reward functions and the cyclist optimal policies to simulate cyclist trajectories for the validation dataset. Results show that the Maximum Entropy (ME) IRL algorithm outperformed the Feature Matching (FM) IRL algorithm, and generally provided reasonable results for modeling such interactions in non-motorized shared spaces, considering the high degrees of freedom in movement and the more-complex road users interactions in such facilities. This research is considered an important step toward developing a full Agent-Based Model (ABM) for road users in shared space facilities to evaluate the safety and efficiency of such facilities. (C) 2020 Elsevier Ltd. All rights reserved.

机译：这项研究的目的是模拟非机动共享空间中混合交通（骑自行车的人）互动的微观行为。视频数据是在不列颠哥伦比亚省温哥华市区罗布森广场非机动共享空间的两个位置收集的。使用计算机视觉算法提取了参与交互的骑自行车者和行人的轨迹。提取的轨迹用于获取描述道路使用者行为要素的多个变量，包括纵向和横向距离，速度和速度差，相互作用角度以及骑车人的加速度和偏航率。使用具有未知奖励功能的有限状态马尔可夫决策过程（MDP）框架，将道路使用者的行为建模为基于效用的智能理性主体。该研究使用两种算法实现了逆向强化学习（IRL）：最大熵（ME）算法和特征匹配（FM）算法，用于在与行人的两种互动中恢复/估计骑车人的奖励功能权重：跟随和超车互动。奖励功能权重可以推断出骑车者在非机动共享空间中与行人互动时的偏好，并且可以形成针对道路使用者的基于代理的微观模拟模型的关键组成部分。此外，估计的奖励函数被用来估计自行车手对于这种交互的最佳策略。使用估计的奖励函数和骑单车的最佳策略开发了一个仿真平台，以为验证数据集模拟单车的轨迹。结果表明，考虑到运动的高度自由度和更复杂的条件，最大熵（IR）算法优于特征匹配（FM）IRL算法，并且通常为非机动共享空间中的此类交互建模提供合理的结果。道路使用者在此类设施中的互动。该研究被认为是为共享空间设施中的道路用户开发基于代理的完整模型（ABM）的重要一步，以评估此类设施的安全性和效率。（C）2020 Elsevier Ltd.保留所有权利。

著录项

来源
《Transportation research》 |2020年第4期|37-57|共21页
作者

展开▼
作者单位

Univ British Columbia Dept Civil Engn 6250 Appl Sci Lane Vancouver BC V6T 1Z4 Canada;

展开▼
收录信息美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Shared space modeling; Overtaking behavior; Following behavior; Simulation; Cyclist and pedestrian; Reward function;

机译：共享空间建模;超车行为;跟随行为;模拟;骑自行车的人和行人;奖励功能;

相似文献

外文文献
中文文献
专利

1. Markov-game modeling of cyclist-pedestrian interactions in shared spaces: A multi-agent adversarial inverse reinforcement learning approach [J] . Alsaleh Rushdi, Sayed Tarek Transportation research . 2021,第Jula期

机译：广播空间中骑自行车者行人互动的马尔可夫 - 游戏模型
2. Learning from Longitudinal Face DemonstrationWhere Tractable Deep Modeling Meets Inverse Reinforcement Learning [J] . Duong Chi Nhan, Quach Kha Gia, Luu Khoa, International Journal of Computer Vision . 2019,第6a7期

机译：从纵向表演中学习讲话的深度建模符合逆钢筋学习
3. Modeling of passengers’ choice using intelligent agents with reinforcement learning in shared interests systems; a basic approach [J] . Vikharev S., Lyapustin M., Mironov D., Transport Problems: an International Scientific Journal: Problemy Transportu . 2019,第2期

机译：在共享利益系统中使用具有增强学习功能的智能代理为乘客的选择建模;基本方法
4. Modeling Interactions of Multimodal Road Users in Shared Spaces [C] . Fatema T. Johora, Jörg P. Müller International Conference on Intelligent Transportation Systems . 2018

机译：共享空间中的多式联运用户互动建模
5. Inferring Structural Models of Travel Behavior: An Inverse Reinforcement Learning Approach [D] . Feygin, Sidney. 2018

机译：推断出行行为的结构模型：反强化学习方法
6. A neural network model for the orbitofrontal cortex and task space acquisition during reinforcement learning [O] . Zhewei Zhang, Zhenbo Cheng, Zhongqiao Lin, 2018

机译：强化学习过程中眶额皮质和任务空间获取的神经网络模型
7. Connections Between Relational Event Model and Inverse Reinforcement Learning for Characterizing Group Interaction Sequences [O] . Congyu Wu 2021

机译：结合事件模型与逆钢筋学习的关系，用于表征组交互序列

Modeling pedestrian-cyclist interactions in shared space using inverse reinforcement learning

摘要

著录项

相似文献

相关主题

期刊订阅