首页> 外文期刊>International Journal of Geographical Information Science >Named entity recognition goes to old regime France: geographic text analysis for early modern French corpora
【24h】

Named entity recognition goes to old regime France: geographic text analysis for early modern French corpora

机译:名为实体识别前往旧制度法国:早期现代法国语料库的地理文本分析

获取原文
获取原文并翻译 | 示例
           

摘要

Geographic text analysis (GTA) research in the digital humanities has focused on projects analyzing modern English-language corpora. These projects depend on temporally specific lexicons and gazetteers that enable place name identification and georesolution. Scholars working on the early modern period (1400-1800) lack temporally appropriate geoparsers and gazetteers and have been reliant on general purpose linked open data services like Geonames. These anachronistic resources introduce significant information retrieval and ethical challenges for early modernists. Using the geography entries of the canonical eighteenth-century Encyclopedie, we evaluate rule-based named entity recognition (NER) systems to pinpoint areas where they would benefit from adjustments for processing historical corpora. As we demonstrate, annotating nested and extended place information is one way to improve early modern GTA. Working with Enlightenment sources also motivates a critique of the landscape of digital geospatial data.
机译:数字人文科学的地理文本分析(GTA)研究专注于分析现代英语语言的项目。这些项目依赖于暂时特定的词典和公布者,使得名称识别和岩土积能够。学者在早期的现代时期工作(1400-1800)缺乏暂时适当的地质标志和公鸡,并且一直依赖于通用的衔接开放数据服务,如地缘名称。这些不间断的资源为早期现代主义者介绍了重大信息检索和道德挑战。使用规范十八世纪百科百科的地理条目,我们评估基于规则的命名实体识别(NER)系统,以确定他们将受益于处理历史上的调整的区域。正如我们所证明的那样,注释嵌套和扩展的地方信息是改善早期现代GTA的一种方式。与启蒙来源合作也激励了数字地理空间数据景观的批评。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号