首页> 外国专利> syntax-based method of providing object classification in compressed video by use of neural network which is learned by cooperation with an external commercial classifier

syntax-based method of providing object classification in compressed video by use of neural network which is learned by cooperation with an external commercial classifier

机译：通过使用神经网络与外部商业分类器协作学习的基于语法的压缩视频对象分类方法

页面导航

摘要
著录项
相似文献

摘要

The present invention relates to a technique for effectively classifying objects from a normal compressed video such as an H.264 AVC or H.265 HEVC video. More specifically, the present invention relates to a technique for quickly acquiring an object classification result by using syntax information (for example, motion vectors and coding types) obtained by parsing compressed image data to extract an area, that is, a moving object area, in which a meaningful motion is present in a video and then, inputting moving attribute information of the moving object area to a neural network, without recognizing and classifying objects from a compressed video generated by, for example, a CCTV camera through complicated image processing as in the conventional art. In particular, the present invention relates to a technique capable of increasing the reliability of the object classification result by enabling the neural network to learn a large amount of data acquired from various photography environments at high speeds through software processing without human intervention by acquiring a combination of the moving attribute information and the object classification information from a plurality of moving object areas in a manner of transmitting an image of the plurality of moving object areas extracted from the compressed video to an external commercial classifier (for example, an Azure cloud service) and receiving the object classification information, and then, by making the neural network learn the same by using the combination.

机译：本发明涉及一种用于从诸如H.264 AVC或H.265 HEVC视频的普通压缩视频中有效地对对象进行分类的技术。更具体地，本发明涉及一种通过使用语法信息（例如，运动矢量和编码类型）来快速获取对象分类结果的技术，该语法信息是通过对压缩图像数据进行解析以提取区域，即运动对象区域，其中，在视频中存在有意义的运动，然后将运动对象区域的运动属性信息输入到神经网络，而无需从例如CCTV摄像机通过复杂的图像处理生成的压缩视频中识别和分类对象，在常规技术中。特别地，本发明涉及一种能够通过使神经网络通过软件处理，无需人工干预，通过获取组合来使神经网络高速学习从各种摄影环境中获取的大量数据的技术，从而能够提高对象分类结果的可靠性。从多个运动对象区域中获取运动属性信息和对象分类信息的方式，是将从压缩视频中提取的多个运动对象区域的图像传输到外部商业分类器（例如，Azure云服务）接收对象分类信息，然后通过组合使神经网络学习相同的信息。

著录项

公开/公告号KR20200067682A

专利类型
公开/公告日2020-06-12

原文格式PDF
申请/专利权人 INNODEP CO. LTD.;
展开▼

申请/专利号KR20180154787
发明设计人 KIM TAE WOO;LEE SUNG JIN;
展开▼

申请日2018-12-04
分类号G06K9/62;G06N3/08;H04N19/20;H04N19/70;
国家 KR
入库时间 2022-08-21 11:06:49

相似文献

专利
外文文献
中文文献