首页> 外文会议>Annual ACM symposium on applied computing;ACM symposium on applied computing;SAC 2010 >Traveling among Clusters: a Way to Reconsider the Benefits of the Cluster Hypothesis
【24h】

Traveling among Clusters: a Way to Reconsider the Benefits of the Cluster Hypothesis

机译:在集群之间旅行:重新考虑集群假说的好处的一种方法

获取原文

摘要

Relying on the Cluster Hypothesis which states that relevant documents tend to be more similar one to each other than to non-relevant documents, most of information retrieval systems organizing search results as a set of clusters seek to gather all relevant documents in the same cluster. We propose here to reconsider the benefits of the entailed concentration of the relevant information. Contrary to what is commonly admitted, we believe that systems which aim to distribute the relevant documents in different clusters, since being more likely to highlight different aspects of the subject, may be at least as useful for the user as systems gathering all relevant documents in a single group. Since existing evaluation measures tend to greatly favor the latter systems, we first investigate ways to more fairly assess the ability to reach the relevant information from the list of cluster descriptions. At last, we show that systems distributing the relevant information in different clusters may actually provide a better information access than classical systems.
机译:依靠集群假说,集群假说指出相关文档之间的相似性比不相关文档更为相似,大多数将搜索结果组织为一组聚类的信息检索系统都试图将所有相关文档收集在同一聚类中。我们在这里建议重新考虑将相关信息集中在一起的好处。与通常所接受的相反,我们认为旨在将相关文档分布在不同类别中的系统,因为它更有可能突出显示主题的不同方面,因此对用户至少与在系统中收集所有相关文档的系统一样有用。一个小组。由于现有的评估方法倾向于极大地支持后一种系统,因此我们首先研究更公平地评估从聚类描述列表中获取相关信息的能力的方法。最后,我们证明了将相关信息分布在不同集群中的系统实际上可能比经典系统提供更好的信息访问。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号