【24h】

Accelerating business analytics applications

机译:加速业务分析应用程序

获取原文

摘要

Business text analytics applications have seen rapid growth, driven by the mining of data for various decision making processes. Regular expression processing is an important component of these applications, consuming as much as 50% of their total execution time. While prior work on accelerating regular expression processing has focused on Network Intrusion Detection Systems, business analytics applications impose different requirements on regular expression processing efficiency. We present an analytical model of accelerators for regular expression processing, which includes memory bus-, I/O bus-, and network-attached accelerators with a focus on business analytics applications. Based on this model, we advocate the use of vector-style processing for regular expressions in business analytics applications, leveraging the SIMD hardware available in many modern processors. In addition, we show how SIMD hardware can be enhanced to improve regular expression processing even further. We demonstrate a realized speedup better than 1.8 for the entire range of data sizes of interest. In comparison, the alternative strategies deliver only marginal improvement for large data sizes, while performing worse than the SIMD solution for small data sizes.
机译:商业文本分析应用程序已经看到了迅速增长,由各种决策过程的数据挖掘驱动。正则表达处理是这些应用程序的重要组成部分,消耗总执行时间的50%。虽然在加速正常表达处理的事先工作的同时,在网络入侵检测系统上,但业务分析应用对正则表达处理效率施加了不同的要求。我们提出了一个用于正规表达处理的加速器的分析模型,包括内存总线,I / O总线和网络连接的加速器,专注于业务分析应用程序。基于此模型,我们主张使用矢量风格处理在业务分析应用中的正则表达式,利用许多现代处理器中提供的SIMD硬件。此外,我们展示了如何提高SIMD硬件,以进一步提高正则表达处理。我们在兴趣的整个数据尺寸范围内展示了一个比1.8更好的加速。相比之下,替代策略仅为大数据规模提供边际改善,同时执行比SIMD解决方案更糟糕的小数据尺寸。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号