首页> 外国专利> Indexing for regular expressions in text-centric applications

Indexing for regular expressions in text-centric applications

机译:在以文本为中心的应用程序中为正则表达式建立索引

摘要

A method, system, and article are provided for evaluating regular expressions over large data collections. A general purpose index is built to handle complex regular expressions at the character level. Characters, character classes, and associated metadata are identified and stored in an index of a collection of documents. Given a regular expression, a query is generated based on the contents of the index. This query is executed over the index to identify a set of documents in the collection of documents over which the regular expression can be evaluated. Based upon the query execution, the identified set of documents is returned for evaluation by the regular expression responsive to execution of the query over the index.
机译:提供了一种用于评估大型数据集合上的正则表达式的方法,系统和文章。建立通用索引可在字符级别处理复杂的正则表达式。字符,字符类和关联的元数据被标识并存储在文档集合的索引中。给定正则表达式,将基于索引的内容生成查询。该查询在索引上执行,以标识文档集合中可对其评估正则表达式的一组文档。基于查询的执行,响应于对索引的查询的执行,由正则表达式返回所标识的文档集以进行评估。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号