首页> 外文会议>International Conference on Field Programmable Logic and Applications >Annotation-based finite-state transducers on reconfigurable devices
【24h】

Annotation-based finite-state transducers on reconfigurable devices

机译:可重新配置设备上的注释的有限状态传感器

获取原文

摘要

With the ever growing amount of unstructured data, high-speed content analysis becomes ever more important. Enabling efficient search functions to help locate specific and relevant information hidden in this big data is a crucial task of today's enterprise systems, and can lead to valuable insights. A key component of content analysis systems are text parsers, which transform unstructured text data into structured information. Cascaded grammars offer a popular and powerful representation of text parsers by enabling the definition of more complex patterns in terms of simpler ones in a hierarchical fashion. This work presents a compilation framework to generate an optimized FPGA pipeline from a cascaded grammar description. We also describe the system integration and the way FPGA-based accelerators can be used as part of larger analysis tasks within Unstructured Information Management Application (UIMA) pipelines. We compare the performance of the hardware-accelerated system and a commercial software implementation using real-life UIMA pipelines from the healthcare domain. We show that the FPGA-accelerated system processes the parsing stage of a UIMA pipeline up to 31 times faster than the software implementation running on a high-end server, which results in an acceleration of up to 5 times for the complete pipeline.
机译:随着不断增长的非结构化数据,高速内容分析变得更加重要。启用有效的搜索功能,以帮助找到隐藏在此大数据中隐藏的特定信息和相关信息是当今企业系统的一个重要任务,并导致有价值的见解。内容分析系统的一个关键组件是文本解析器,将非结构化文本数据转换为结构化信息。级联语法通过在分层时尚的简单方面使更复杂的模式进行更复杂的模式来提供文本解析器的流行和强大的表示。这项工作介绍了一个编译框架,可以从级联语法描述生成优化的FPGA流水线。我们还描述了系统集成,基于FPGA的加速器的方式可用作非结构化信息管理应用程序(UIMA)管道内的较大分析任务的一部分。我们使用来自医疗领域的现实生活UIMA管道的硬件加速系统和商业软件实现的性能进行比较。我们表明FPGA加速系统将UIMA流量的解析阶段加工到高端服务器上运行的软件实现快31倍,这导致完整流水线的加速度最多5倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号