...
首页> 外文期刊>Transportation research >Modeling pedestrian-cyclist interactions in shared space using inverse reinforcement learning
【24h】

Modeling pedestrian-cyclist interactions in shared space using inverse reinforcement learning

机译:使用逆强化学习对共享空间中的行人与骑行者交互进行建模

获取原文
获取原文并翻译 | 示例
           

摘要

The objective of this study is to model the microscopic behaviour of mixed traffic (cyclistpedestrian) interactions in non-motorized shared spaces. Video data were collected at two locations of Robson Square non-motorized shared space in downtown Vancouver, British Columbia. Trajectories of cyclists and pedestrians involved in interactions were extracted using computer vision algorithms. The extracted trajectories were used to obtain several variables that describe elements of road users' behaviour including longitudinal and lateral distances, speed and speed differences, interaction angle, and cyclist acceleration and yaw rate. The road users behaviour was modeled as utility-based intelligent rational agents using the finite-state Markov Decision Process (MDP) framework with unknown reward functions. The study implemented Inverse Reinforcement Learning (IRL) using two algorithms: the Maximum Entropy (ME) algorithm, and the Feature Matching (FM) algorithm to recover/estimate the reward function weights of cyclists in two types of interactions with pedestrians: following and overtaking interactions. Reward function weights infer cyclist preferences during their interactions with pedestrians in non-motorized shared spaces, and can form the key component in developing agent based microsimulation model for road users. Furthermore, the estimated reward functions were used to estimate cyclists' optimal policy for such interactions. A simulation platform was developed using the estimated reward functions and the cyclist optimal policies to simulate cyclist trajectories for the validation dataset. Results show that the Maximum Entropy (ME) IRL algorithm outperformed the Feature Matching (FM) IRL algorithm, and generally provided reasonable results for modeling such interactions in non-motorized shared spaces, considering the high degrees of freedom in movement and the more-complex road users interactions in such facilities. This research is considered an important step toward developing a full Agent-Based Model (ABM) for road users in shared space facilities to evaluate the safety and efficiency of such facilities. (C) 2020 Elsevier Ltd. All rights reserved.
机译:这项研究的目的是模拟非机动共享空间中混合交通(骑自行车的人)互动的微观行为。视频数据是在不列颠哥伦比亚省温哥华市区罗布森广场非机动共享空间的两个位置收集的。使用计算机视觉算法提取了参与交互的骑自行车者和行人的轨迹。提取的轨迹用于获取描述道路使用者行为要素的多个变量,包括纵向和横向距离,速度和速度差,相互作用角度以及骑车人的加速度和偏航率。使用具有未知奖励功能的有限状态马尔可夫决策过程(MDP)框架,将道路使用者的行为建模为基于效用的智能理性主体。该研究使用两种算法实现了逆向强化学习(IRL):最大熵(ME)算法和特征匹配(FM)算法,用于在与行人的两种互动中恢复/估计骑车人的奖励功能权重:跟随和超车互动。奖励功能权重可以推断出骑车者在非机动共享空间中与行人互动时的偏好,并且可以形成针对道路使用者的基于代理的微观模拟模型的关键组成部分。此外,估计的奖励函数被用来估计自行车手对于这种交互的最佳策略。使用估计的奖励函数和骑单车的最佳策略开发了一个仿真平台,以为验证数据集模拟单车的轨迹。结果表明,考虑到运动的高度自由度和更复杂的条件,最大熵(IR)算法优于特征匹配(FM)IRL算法,并且通常为非机动共享空间中的此类交互建模提供合理的结果。道路使用者在此类设施中的互动。该研究被认为是为共享空间设施中的道路用户开发基于代理的完整模型(ABM)的重要一步,以评估此类设施的安全性和效率。 (C)2020 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号