首页> 美国卫生研究院文献>Sensors (Basel Switzerland) >An FPGA-Based Ultra-High-Speed Object Detection Algorithm with Multi-Frame Information Fusion
【2h】

An FPGA-Based Ultra-High-Speed Object Detection Algorithm with Multi-Frame Information Fusion

机译:基于FPGA的多帧信息融合超高速目标检测算法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

An ultra-high-speed algorithm based on Histogram of Oriented Gradient (HOG) and Support Vector Machine (SVM) for hardware implementation at 10,000 frames per second (FPS) under complex backgrounds is proposed for object detection. The algorithm is implemented on the field-programmable gate array (FPGA) in the high-speed-vision platform, in which 64 pixels are input per clock cycle. The high pixel parallelism of the vision platform limits its performance, as it is difficult to reduce the strides between detection windows below 16 pixels, thus introduce non-negligible deviation of object detection. In addition, limited by the transmission bandwidth, only one frame in every four frames can be transmitted to PC for post-processing, that is, 75% image information is wasted. To overcome the mentioned problem, a multi-frame information fusion model is proposed in this paper. Image data and synchronization signals are first regenerated according to image frame numbers. The maximum HOG feature value and corresponding coordinates of each frame are stored in the bottom of the image with that of adjacent frames’. The compensated ones will be obtained through information fusion with the confidence of continuous frames. Several experiments are conducted to demonstrate the performance of the proposed algorithm. As the evaluation result shows, the deviation is reduced with our proposed method compared with the existing one.
机译:提出了一种基于定向梯度直方图(HOG)和支持向量机(SVM)的超高速算法,用于在复杂背景下以10,000帧/秒(FPS)的硬件实现。该算法在高速视觉平台的现场可编程门阵列(FPGA)上实现,每个时钟周期输入64个像素。视觉平台的高像素并行度限制了它的性能,因为很难将检测窗口之间的步幅减小到16个像素以下,从而引入了不可忽略的物体检测偏差。另外,受传输带宽的限制,每四帧中只有一帧可以发送到PC进行后期处理,即浪费了75%的图像信息。为了克服上述问题,提出了一种多帧信息融合模型。首先根据图像帧号重新生成图像数据和同步信号。每个帧的最大HOG特征值和对应的坐标与相邻帧的HOG特征值和对应的坐标一起存储在图像的底部。补偿后的帧将通过信息融合获得连续帧的置信度。进行了一些实验,以证明该算法的性能。如评估结果所示,与现有方法相比,我们的方法减少了偏差。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号