首页> 外国专利> Retrieving handwritten documents using multiple document recognizers and techniques allowing both typed and handwritten queries

Retrieving handwritten documents using multiple document recognizers and techniques allowing both typed and handwritten queries

机译:使用多种文档识别器和技术检索手写文档,同时允许键入和手写查询

摘要

The techniques in the present invention allow both text and handwritten queries, and the queries can be single-word or multiword. Generally, each handwritten word in a handwritten document is converted to a document stack of words, where each document stack contains a list of text words and a word score of some type for each text word in the list. The query is also converted to one or more stacks of words. A measure is determined from each query and document stack. Documents that meet search criteria in the query are then selected based on the query and the values of the measures. The present invention also performs multiple recognitions, with multiple recognizers, on a handwritten document to create multiple recognized transcriptions of the document. The multiple transcriptions are used for document retrieval. In another embodiment, a single transcription is created from the multiple transcriptions, and the single transcription is used for document retrieval.
机译:本发明中的技术允许文本查询和手写查询,并且查询可以是单字或多字。通常,手写文档中的每个手写单词都将转换为单词文档堆栈,其中每个文档堆栈都包含一个文本单词列表和该列表中每个文本单词的某种类型的单词分数。查询也将转换为一个或多个单词堆栈。根据每个查询和文档堆栈确定度量。然后根据查询和度量值选择满足查询中搜索条件的文档。本发明还在手写文档上执行具有多个识别器的多个识别,以创建该文档的多个识别的转录。多个转录用于文档检索。在另一个实施例中,从多个转录创建单个转录,并且将该单个转录用于文档检索。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号