首页> 外国专利> METHOD AND APPARATUS FOR EXTRACTING POI NAME, DEVICE, AND COMPUTER STORAGE MEDIUM

METHOD AND APPARATUS FOR EXTRACTING POI NAME, DEVICE, AND COMPUTER STORAGE MEDIUM

机译:提取POI名称,设备和计算机存储介质的方法和装置

摘要

The present application discloses a method and apparatus for extracting the name of a POI, a device and a computer storage medium, and relates to the field of big data. An implementation includes: acquiring two or more text fragments identified from image data of the POI; constructing two or more candidate names using the text fragments; and ranking the candidate names using a pre-trained name ranking model, and determining the name of the POI according to the result of the ranking; wherein the name ranking model determines the probability of each candidate name as the name of the POI using at least one of a search web page feature, a document statistical feature and a semantic feature extracted from each candidate name, and ranks the candidate names according to the probabilities. With the present application, the name of the POI is automatically extracted with high accuracy. Compared with the manual review and annotation way in the prior art, a human cost is reduced.
机译:本申请公开了一种用于提取POI,设备和计算机存储介质的名称的方法和装置,并涉及大数据的领域。实现包括:获取从POI的图像数据中标识的两个或多个文本片段;使用文本碎片构建两个或多个候选名称;并使用预先培训的名称排名模型排列候选名称,并根据排名的结果确定POI的名称;其中,名称排名模型将每个候选名称的概率确定为使用来自每个候选名称中提取的文档统计特征,文档统计特征和从每个候选名称中提取的语义特征中的至少一个的POI的名称,并根据候选名称排列候选名称概率。通过本申请,POI的名称以高精度自动提取。与现有技术中的手工评论和注释方式相比,人类成本降低。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号