首页> 外文会议>Intelligent Transportation Systems, 2005. Proceedings. 2005 IEEE >Identification of rear-end crash patterns on instrumented freeways: a data mining approach
【24h】

Identification of rear-end crash patterns on instrumented freeways: a data mining approach

机译:识别高速公路上的追尾事故模式:一种数据挖掘方法

获取原文

摘要

Data mining is the analysis of large "observational" datasets to find unsuspected relationships that might be useful to the data owner. It typically involves analysis where objectives of the mining exercise have no bearing on the data collection strategy. Freeway traffic surveillance data collected through underground loop detectors is one such "observational" database maintained for various ITS (intelligent transportation systems) applications such as travel time prediction etc. In this research data mining process is used to relate this surrogate measure of traffic conditions (data from freeway loop detectors) with occurrence of rear-end crashes on freeways. The results from this analysis are envisioned to be the first step in the development of a functional proactive traffic management system. The dataset under consideration includes information on crashes and corresponding traffic data collected from detectors neighboring the crash locations just prior to the time of the crash. The problem is setup as a classification problem for a crash being rear-end vs. not. Three types of classification tree involving different splitting criterion were attempted for variable selection. It was found that the classification tree with chi sq. test as the splitting criterion resulted in the most inclusive list of variables. The variable selection was followed by two neural network architectures, namely, the RBF (radial basis function) and MLP (multi-layer perceptron) to model the binary target variable. The two neural network models were then combined based on their output to achieve any possible improvement in the classification accuracy. It was found, however, that the classification tree model with chi sq. test as splitting criterion (with more than 65% classification accuracy) was better than any of the individual or combined neural network models (54-55% classification accuracy). Since the decision tree model also provides simple interpretable rules to classify the data in a real-time application it was recommended as the final classification model.
机译:数据挖掘是对大型“观测”数据集的分析,以发现可能对数据所有者有用的不可怀疑的关系。它通常涉及分析,其中采矿活动的目标与数据收集策略无关。通过地下环路检测器收集的高速公路交通监控数据就是为各种ITS(智能交通系统)应用(例如行程时间预测等)维护的此类“观测”数据库。在本研究中,数据挖掘过程用于关联交通状况的替代度量(来自高速公路环路检测器的数据),并在高速公路上发生追尾事故。这种分析的结果被认为是功能性主动型交通管理系统开发的第一步。所考虑的数据集包括有关碰撞的信息,以及恰好在碰撞发生之前从邻近碰撞位置的检测器收集的相应交通数据。该问题被设置为针对后端崩溃与非崩溃崩溃的分类问题。尝试了三种涉及不同分裂准则的分类树来进行变量选择。结果发现,以卡方检验作为划分标准的分类树产生了最广泛的变量列表。变量选择之后是两个神经网络体系结构,即RBF(径向基函数)和MLP(多层感知器),以对二进制目标变量进行建模。然后根据两个神经网络模型的输出进行组合,以实现分类精度的任何可能的提高。但是,发现以卡方检验为划分标准的分类树模型(分类精度超过65%)比任何单个或组合的神经网络模型(分类精度为54-55%)都要好。由于决策树模型还提供了简单的可解释规则来对实时应用程序中的数据进行分类,因此建议将其作为最终分类模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号