首页> 外文期刊>International Journal of Business Intelligence and Data Mining >Information retrieval on documents methodology based on entropy filtering methodologies
【24h】

Information retrieval on documents methodology based on entropy filtering methodologies

机译:基于熵过滤方法的文献信息检索方法

获取原文
获取原文并翻译 | 示例
           

摘要

Information retrieval problem occurs when the target information is not available 'literally' into the set of documents. In problems in which the goal is to find 'hidden' information, it is important to develop hybrid methodologies or improve and design a new one. In this work the authors are dealing with identifying the most informative piece of data on a collection of documents, in order to obtain the best result on a posterior fuzzy clustering stage. The aim is to find similarities between the documents and a reference target, to establish relationships related to a non-literal feature. We propose to apply the well-known entropy term weighting scheme and then show a posterior different procedures to the right election of the interest data. This procedure brings the biggest amount of information within the smallest amount of data. Applying a specific selection procedure for a group of words, gives more information to differentiate and separate the documents after using the entropy weighting. This returns considerable results on the processing time and the right fuzzy clustering of the documents collection.
机译:当目标信息在字面意义上不可用时,就会出现信息检索问题。在目标是寻找“隐藏”信息的问题中,开发混合方法或改进和设计新方法非常重要。在这项工作中,作者正在处理在文档集合中识别最有用的数据,以便在后模糊聚类阶段获得最佳结果。目的是在文档和参考目标之间找到相似之处,以建立与非文字特征相关的关系。我们建议应用众所周知的熵项加权方案,然后对利息数据的正确选择显示不同的后验程序。此过程可在最少的数据量内带来最大的信息量。对一组单词应用特定的选择过程,可以在使用熵加权后提供更多信息来区分和分离文档。这将在处理时间和文档集合的正确模糊聚类上返回可观的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号