首页> 外文期刊>Journal of Emerging Technologies in Web Intelligence >Automatic Generation of Human-like Route Descriptions: A Corpus-driven Approach
【24h】

Automatic Generation of Human-like Route Descriptions: A Corpus-driven Approach

机译:自动生成类人路线描述:语料库驱动的方法

获取原文
       

摘要

—Most of Web applications combines differents services, features and contents in order to enable the creation of new features and services. Such systems are called mashups. One of the most popular kind of mashups are the location ones that use geographic data to provide functionalites to users. The RotaCerta is a location system that uses the Google Maps and perform Natural Language Generation to provide textual descriptions of routes between two different locations. The great advantage of RotaCerta is the use of points of interest (POI) to describe routes. POIs help the user to understand and assimilate the route. However, RotaCerta suffers from a several limitation: the need for manualy updating of a POIs dataset. Such work is exhausting, costly and greatly limits their use. Another point to highlight is the poor linguistic variability of texts it provides. In this work, we propose a mechanism to enable automatic feeding of POIs and a corpus-driven approach to enhance the linguistic variability of location mashups such as RotaCerta.We adopt both manual and automatic generation of new textual templates. In order to assess the quality of the routes descriptions, we use TF-IDF and cosine distance to calculate the similarity between descriptions of routes created by human volunteers and descriptions generated by the proposed approach. Route generation examples have been performed for three different brazilian cities. We also show that the text generated from the new template base is more similar to the texts used by people when describing routes if compared to Google Maps.
机译:—大多数Web应用程序结合了不同的服务,功能和内容,以便能够创建新的功能和服务。这种系统称为mashup。位置混搭是最受欢迎的一种混搭,它使用地理数据为用户提供功能。 RotaCerta是一个定位系统,使用Google地图并执行“自然语言生成”以提供两个不同位置之间路线的文字描述。 RotaCerta的最大优势是使用兴趣点(POI)来描述路线。 POI可以帮助用户理解和吸收路线。但是,RotaCerta有几个限制:需要手动更新POIs数据集。这样的工作费力,昂贵并且极大地限制了其使用。值得强调的另一点是它提供的文本的语言变异性很差。在这项工作中,我们提出了一种自动启用POI的机制,以及一种语料库驱动的方法来增强位置混搭(如RotaCerta)的语言变异性。我们采用手动和自动方式生成新的文本模板。为了评估路线描述的质量,我们使用TF-IDF和余弦距离来计算人类志愿者创建的路线描述与所提出方法生成的描述之间的相似度。已经针对三个不同的巴西城市进行了路线生成示例。我们还显示,与Google Maps相比,从新模板库生成的文本与人们在描述路线时使用的文本更加相似。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号