首页> 外国专利> KEYWORD EXTRACTING METHOD USING TEXT MINING

KEYWORD EXTRACTING METHOD USING TEXT MINING

机译:使用文本挖掘的关键词提取方法

摘要

A keyword extraction method is disclosed. The keyword extraction method comprises the steps of: text-mining each of a plurality of technical documents to generate a document-term matrix in which term frequency (TF) of each of a plurality of terms included in each of the technical documents is used as an element; determining a first weight of each element of the document-term matrix using inverse document frequency (IDF); determining, as a second weight, a value that the total sum of the first weights corresponding to the terms is divided by the number of documents that include the terms; and selecting a keyword based on the second weight. Therefore, by text-mining a plurality of technical documents, the technical documents may be analyzed through the structured data.
机译:公开了一种关键词提取方法。关键字提取方法包括以下步骤:对多个技术文档中的每一个进行文本挖掘以生成文档术语矩阵,其中将每个技术文档中包括的多个术语中的每个术语的术语频率(TF)用作一个元素;使用反文档频率(IDF)确定文档项矩阵的每个元素的第一权重;确定第二权重的值是将与这些术语相对应的第一权重的总和除以包括这些术语的文档的数量;并根据第二权重选择关键字。因此,通过文本挖掘多个技术文档,可以通过结构化数据来分析技术文档。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号