首页> 外文学位 >A machine-aided approach to intelligent index generation: Using natural language processing and latent semantic analysis to determine the contexts and relationships among words in a corpus.
【24h】

A machine-aided approach to intelligent index generation: Using natural language processing and latent semantic analysis to determine the contexts and relationships among words in a corpus.

机译:一种机器辅助的智能索引生成方法:使用自然语言处理和潜在语义分析来确定语料库中单词之间的上下文和关系。

获取原文
获取原文并翻译 | 示例

摘要

Back-of-the-book indexing is the process of generating a list of relevant terms, sub-terms and cross-references from a corpus and providing the user with corresponding page references.; Several cognitive tasks are necessary to produce a good index, and are performed primarily by the human indexer. Indexing has become somewhat automated through computer applications, which at best generate a concordance, and exist to reduce the mundane portions of the process. However, none of these tools determines which terms to index, nor do they capture context-sensitive information about terms and their relationships. Human indexers perform these time-consuming tasks.; The challenge is to develop software that bridges the gap between computerized concordances and manual indexing. The prototype application described herein is unique in its ability to incorporate the intelligent portions of the process. Because of this, it provides a robust draft index that a human indexer can refine in a fraction of the time.
机译:书后索引是从语料库中生成相关术语,子术语和交叉引用列表并为用户提供相应页面引用的过程。几个认知任务对于产生良好的索引是必要的,并且主要由人类索引器执行。索引已通过计算机应用程序实现了某种程度的自动化,充其量只能产生一致性,并且可以减少过程中的平凡部分。但是,这些工具都不能确定要索引的术语,也不能捕获有关术语及其关系的上下文相关信息。人类索引器执行这些耗时的任务。挑战在于开发弥合计算机一致性和手动索引之间的差距的软件。本文描述的原型应用程序在合并过程的智能部分方面具有独特的能力。因此,它提供了一个健壮的草稿索引,人类索引员可以在短时间内完善该索引。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号