Clustering of Web Search Results Based on Document Segmentation

首页> 外文期刊>Computer and Information Science >Clustering of Web Search Results Based on Document Segmentation

【24h】

Clustering of Web Search Results Based on Document Segmentation

机译：基于文档细分的Web搜索结果聚类

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The process of clustering documents in a manner which produces accurate and compact clusters becomes increasingly significant mainly with the vast size of information on the web. This problem becomes even more complicated with the multi-topics nature of documents these days. In this paper, we deal with the problem of clustering documents retrieved by a search engine, where each document deals with multiple topics. Our approach is based on segmenting each document into a number of segments and then clustering segments of all documents using the Lingo algorithm. We evaluate the quality of clusters obtained by clustering full documents directly and by clustering document segments using the distance-based average intra-cluster similarity measure. Our results illustrate that average intra-cluster similarity is increased by approximately 75% as a result of clustering document segments as compared to clustering full documents retrieved by the search engine.

机译：主要通过网络上的大量信息，以产生准确而紧凑的簇的方式对文档进行簇化的过程变得越来越重要。如今，随着文档的多主题性质，这个问题变得更加复杂。在本文中，我们处理了将搜索引擎检索的文档聚类的问题，其中每个文档都涉及多个主题。我们的方法基于将每个文档分为多个段，然后使用Lingo算法将所有文档的段聚类。我们评估通过直接聚类完整文档和使用基于距离的平均聚类内相似性度量聚类文档片段而获得的聚类质量。我们的结果表明，与对搜索引擎检索到的完整文档进行聚类相比，聚类文档段可将平均聚类内相似度提高约75％。

著录项

来源
《Computer and Information Science》 |2013年第3期|共1页
作者

展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Clustering of Web Search Results Based on Document Segmentation [J] . Mohammad Hasan Haggag, Amal Aboutabl, Najla Mukhtar Computer and information science . 2013,第3期

机译：基于文档细分的Web搜索结果聚类
2. Clustering of Web Search Results Based on Document Segmentation [J] . Mohammad Hasan Haggag, Amal Aboutabl, Najla Mukhtar Computer and Information Science . 2013,第3期

机译：基于文档细分的Web搜索结果聚类
3. Enhancing web search by using query-based clusters and multi-document summaries [J] . Qumsiyeh Rani, Ng Yiu-Kai Knowledge and information systems . 2016,第2期

机译：通过使用基于查询的群集和多文档摘要来增强Web搜索
4. Contextual Query based on Segmentation and Clustering of Selected Documents for Acquiring Web Documents for Supporting Knowledge Management [C] . Joao C. Prates, Sean S. M. Siqueira Americas conference on information systems;AMCIS 2011 . 2011

机译：基于选定文档的细分和聚类的上下文查询，以获取支持知识管理的Web文档
5. Clustering Web documents: A phrase-based method for grouping search engine results. [D] . Zamir, Oren Eli. 1999

机译：Web文档群集：一种基于短语的方法，用于对搜索引擎结果进行分组。
6. Enhancing web search result clustering model based on multiview multirepresentation consensus cluster ensemble (mmcc) approach [O] . Ali Sabah, Sabrina Tiun, Nor Samsiah Sani, 2021

机译：基于MultiView多重特派复断的共识群集（MMCC）方法增强基于MultiView Multimirepration的群集群集模型
7. Clustering of Web Search Results Based on Document Segmentation [O] . Amal Aboutabl, Mohammad Hasan Haggag, Najla Mukhtar 2013

机译：基于文档细分的Web搜索结果聚类
8. Web Page Clustering using Heuristic Search in the Web Graph [R] . Bekkerman, R. , Zilberstein, S. , Allan, J. 2006

机译：Web图中使用启发式搜索的网页聚类

Clustering of Web Search Results Based on Document Segmentation

摘要

著录项

相似文献

相关主题

期刊订阅