首页> 中文期刊> 《计算机科学》 >通用抽取引擎框架:一种新的Web信息抽取方法的研究

通用抽取引擎框架:一种新的Web信息抽取方法的研究

         

摘要

The large size of video collection not only provides an easy way for users to share information, but also brings a big challenge for managing them, in particular online monitoring.A critical requirement to monitor the video information is to accurately and adaptively identify the key information describing the video,which is also the first step for video analysis and video search.In this paper, we focused on the extraction problem of the video information from different websites.Specifically, we proposed an engine framework for information extraction.We formally defined the description model in the framework and implemented a customizable engine for information.The proposed framework has been applied to a real-world application of a national department and obtains promising results.Experimental results show that the proposed approach can effectively extract the video information and it significantly outperforms the baseline methods.%大规模的网络视频信息既为用户信息分享带来了方便,同时也为国家监管部门带来了新的挑战.考虑到效率问题,在线视频监管则主要考虑视频描述信息.主要研究了网络视频描述信息的抽取问题,提出了一种新的Web信息抽取方法:通用抽取引擎框架,其主要包括对视频描述信息抽取问题的形式化描述和用户感知的视频网站逻辑模型.该方法在国家某部委的视频监管项目中已得到应用,并取得了很好的效果.实验结果表明,该方法的扩展性、通用性和抽取准确率大大优于其他方法.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号