Data Preprocessing in Web Text Mining

机译：Web文本挖掘中的数据预处理

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

the development of highly efficient and effective search engines is accelerated by the abundant WWW information and people's need for high quality information.Web text mining is one of the key techniques for search engines.But Web data is much complex which enlarges the difficulty in web text mining.To get good mining results, Web page preprocessing is necessary before any text mining starting.Here given the pages set collected from the Robot of search engines, we discussed some essential work to present pages in vectors, such as the term selection, weights presentation, etc.The purpose is to make preparation for the following Web text mining task.

机译：大量的WWW信息和人们对高质量信息的需求促进了高效高效搜索引擎的发展。Web文本挖掘是搜索引擎的关键技术之一。但是Web数据非常复杂，这加大了Web文本的难度为了获得良好的挖掘结果，在进行任何文本挖掘之前都必须对网页进行预处理。在给定从搜索引擎机器人收集的页面集的情况下，我们讨论了以向量表示页面的一些基本工作，例如术语选择，权重目的是为以下Web文本挖掘任务做准备。

著录项

来源
《International conference on education technology and computer》|2011年|643-647|共5页
会议地点
作者
Jiang Yongbo; Zhang Ruili;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机化教学;
关键词
data preprocessing; Web text mining; search engine;

机译：数据预处理; Web文本挖掘;搜索引擎;

相似文献

外文文献
中文文献
专利

1. Data preprocessing evaluation for web log mining: reconstruction of activities of a web visitor [J] . Michal Munk, Jozef Kapusta, Peter ?vec Procedia Computer Science . 2010,第1期

机译：Web日志挖掘的数据预处理评估：Web访问者活动的重建
2. Tools and Databases of the KOMICS Web Portal for Preprocessing, Mining, and Dissemination of Metabolomics Data [J] . Nozomu Sakurai, Takeshi Ara, Mitsuo Enomoto, BioMed research international . 2014,第14期

机译：KOMICS Web门户的工具和数据库，用于代谢组学数据的预处理，挖掘和传播
3. Tools and Databases of the KOMICS Web Portal for Preprocessing, Mining, and Dissemination of Metabolomics Data [J] . NozomuSakurai, TakeshiAra, MitsuoEnomoto, BioMed research international . 2014,第3期

机译：KOMICS Web门户的工具和数据库，用于代谢组学数据的预处理，挖掘和传播
4. DATA PREPROCESSING IN WEB TEXT MINING [C] . JIANG YONGBO International Conference on Advanced Computer Theory and Engineering . 2012

机译：网络文本挖掘中的数据预处理
5. Rediscovering Social Science and Business Studies Using Web Data and the Text Mining Approach. [D] . Xue, Yuan. 2016

机译：使用Web数据和文本挖掘方法重新发现社会科学和商业研究。
6. Tools and Databases of the KOMICS Web Portal for Preprocessing Mining and Dissemination of Metabolomics Data [O] . Nozomu Sakurai, Takeshi Ara, Mitsuo Enomoto, -1

机译：KOMICS Web门户的工具和数据库用于代谢组学数据的预处理挖掘和传播
7. Data preprocessing evaluation for web log mining: reconstruction of activities of a web visitor [O] . Munk Michal, Kapusta Jozef, Švec Peter 2010

机译：Web日志挖掘的数据预处理评估：Web访问者活动的重建

Data Preprocessing in Web Text Mining

摘要

著录项

相似文献

相关主题

期刊订阅