An Efficient Association Rule Based Clustering of XML Documents

A. Muralidhar; V. Pattabiraman

首页> 外文期刊>Procedia Computer Science >An Efficient Association Rule Based Clustering of XML Documents

【24h】

An Efficient Association Rule Based Clustering of XML Documents

机译：基于有效关联规则的XML文档聚类

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Mining the web data is one of the emerging researches in data mining. The HTML can be used for maintaining the web data but it is hard to achieve the accurate web mining results from HTML documents. The XML documents make more convenient for finding the properties in web mining. Association rule based mining discovers the temporal associations among XML documents. But this kind of data mining is not sufficient to retrieve the properties of every XML document. Finding the properties for set of similar documents is better idea rather than to find the property of a single document. Hence, the key contribution of the work is to find the meaningful clustered based associations by association rule based clustering. Therefore, this paper proposes a hybrid approach which discovers the frequent XML documents by association rule mining and then find the clustering of XML documents by classical k-means algorithm. The proposed approach was tested with real data of Wikipedia. The comparative study and result analysis are discussed in the paper for knowing the importance of the proposed work.

机译：挖掘Web数据是数据挖掘中的新兴研究之一。 HTML可以用于维护Web数据，但是很难从HTML文档中获得准确的Web挖掘结果。 XML文档使在Web挖掘中查找属性更加方便。基于关联规则的挖掘发现XML文档之间的时间关联。但是，这种数据挖掘不足以检索每个XML文档的属性。找到一组相似文档的属性比找到单个文档的属性更好。因此，这项工作的关键贡献是通过基于关联规则的聚类找到有意义的基于聚类的关联。因此，本文提出了一种混合方法，该方法通过关联规则挖掘发现频繁的XML文档，然后通过经典的k-means算法找到XML文档的聚类。 Wikipedia的真实数据对提出的方法进行了测试。本文讨论了比较研究和结果分析，以了解拟议工作的重要性。

著录项

来源
《Procedia Computer Science》 |2015年第1期|共7页
作者
A. Muralidhar; V. Pattabiraman;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Chi-Square Test for Anomaly Detection in XML Documents Using Negative Association Rules [J] . K. Premalatha, A.M. Natarajan Computer and Information Science . 2009,第1期

机译：使用负关联规则对XML文档中的异常进行检测的卡方检验
2. HCMX: AN EFFICIENT HYBRID CLUSTERING APPROACH FOR MULTI-VERSION XML DOCUMENTS [J] . VIJAY SONAWANE, D.RAJESWARA.RAO Journal of Theoretical and Applied Information Technology . 2015,第1期

机译：HCMX：一种用于多版本XML文档的高效混合集群方法
3. Clustered Chain Path Index for XML Document: Efficiently Processing Branch Queries [J] . Hongqiang Wang, Jianzhong Li, Hongzhi Wang World Wide Web . 2008,第1期

机译：XML文档的群集链路径索引：有效处理分支查询
4. An Efficient Association Rule Based Clustering of XML Documents [C] . A Muralidhar, V Pattabiraman International Symposium on Big Data, Cloud and Computing Challenges . 2015

机译：基于高效的关联规则基于XML文档的聚类
5. XML2REL: An efficient system for storing and querying XML documents using relational databases [D] . Atay, Mustafa 2006

机译：XML2REL：使用关系数据库存储和查询XML文档的有效系统
6. Clinical map document based on XML (cMDX): document architecture with mapping feature for reporting and analysing prostate cancer in radical prostatectomy specimens [O] . Okyaz Eminaga, Reemt Hinkelammert, Axel Semjonow, 2010

机译：基于XML（cMDX）的临床地图文档：具有映射功能的文档体系结构用于报告和分析前列腺癌根治术标本中的前列腺癌
7. An Efficient Association Rule Based Clustering of XML Documents [O] . Muralidhar A., Pattabiraman V. 2015

机译：基于有效关联规则的XML文档聚类

An Efficient Association Rule Based Clustering of XML Documents

摘要

著录项

相似文献

相关主题

期刊订阅