Improved clustering technique using metadata for text mining

机译：使用元数据进行文本挖掘的改进聚类技术

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In many text mining applications, information from Document is present in the form of Text along with Side Information or Metadata. Examples of this side information include links to other web pages, title of the document, author name or date of Publication which are present in the text document. Such metadata may possess a lot of information for the clustering purposes. But this Side information may be sometimes noisy. Using such Side Information for producing clusters without filtering it, can result to bad quality of Clusters. So we use an efficient Feature Selection method to perform the mining process to select that Side Information which is useful for Clustering so as to maximize the advantages from using it. The proposed technique makes use of the process of Two-mode clustering which is a data mining technique that allows producing groups by Clustering both Text and Side Information.

机译：在许多文本挖掘应用程序中，来自文档的信息以文本以及辅助信息或元数据的形式出现。此辅助信息的示例包括文本文档中存在的其他网页链接，文档标题，作者姓名或出版日期。这样的元数据可能拥有大量信息以用于聚类目的。但是此补充信息有时可能很嘈杂。使用此类辅助信息来生成群集而不对其进行过滤可能会导致群集质量下降。因此，我们使用一种有效的“特征选择”方法来执行挖掘过程，以选择对聚类有用的“边信息”，从而最大程度地利用它。所提出的技术利用了双模式聚类的过程，该过程是一种数据挖掘技术，它允许通过聚类文本和边信息来产生组。

著录项

来源
《International Conference on Communication and Electronics Systems》|2016年|1-5|共5页
会议地点
作者
Ramya Elizabeth Thomas; Shamsuddin S. Khan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Clustering algorithms; Indexes; Algorithm design and analysis; Partitioning algorithms; Noise measurement; Spatial databases; Data mining;

机译：聚类算法;索引;算法设计与分析;分区算法;噪声测量;空间数据库;数据挖掘;

相似文献

外文文献
中文文献
专利

1. Mining Text Data using different Text Clustering Techniques [J] . Ratna S. Patil, Prof. B. S. Chordia International Journal of Computer Trends and Technology . 2017,第2期

机译：使用不同的文本聚类技术挖掘文本数据
2. Significant Term List Based Metadata Conceptual Mining Model for Effective Text Clustering [J] . Koteeswaran S., J. Janet and E. Kannan Journal of computer sciences . 2012,第10期

机译：基于有效术语列表的元数据概念挖掘模型，用于有效的文本聚类
3. Significant Term List Based Metadata Conceptual Mining Model for Effective Text Clustering | Science Publications [J] . E. Kannan, J. Janet, S. Koteeswaran Journal of computer sciences . 2012,第10期

机译：有效术语聚类的基于重要术语列表的元数据概念挖掘模型科学出版物
4. Improved clustering technique using metadata for text mining [C] . Ramya Elizabeth Thomas, Shamsuddin S. Khan International Conference on Communication and Electronics Systems . 2016

机译：使用元数据进行文本挖掘的改进聚类技术
5. Automated generation of metadata for mining image and text data. [D] . Al-Shameri, Faleh Jassem. 2006

机译：自动生成用于挖掘图像和文本数据的元数据。
6. Combining QSAR Modeling and Text-Mining Techniques to Link Chemical Structures and Carcinogenic Modes of Action [O] . George Papamokos, Ilona Silins 2016

机译：结合QSAR建模和文本挖掘技术来链接化学结构和致癌作用模式
7. Significant Term List Based Metadata Conceptual Mining Model for Effective Text Clustering [O] . J. Janet, E. Kannan 2015

机译：基于重要术语表的元数据概念挖掘模型的有效文本聚类
8. Design of Surface Mining Systems in the Eastern Kentucky Coal Fields. Part One. Research and Demonstration of Improved Surface Mining Techniques. Volume III. [R] . 1975

机译：肯塔基州东部煤田露天采矿系统设计。第一部分。改进露天采矿技术的研究与论证。第三卷。

Improved clustering technique using metadata for text mining

摘要

著录项

相似文献

相关主题

期刊订阅