...
首页> 外文期刊>Computer Science & Information Technology >Arabic Tweets Categorization Based on Rough Set Theory
【24h】

Arabic Tweets Categorization Based on Rough Set Theory

机译:基于粗糙集理论的阿拉伯文推文分类

获取原文
           

摘要

Twitter is a popular microblogging service where users create status messages (called搕weets?. These tweets sometimes express opinions about different topics; and are presented tothe user in a chronological order. This format of presentation is useful to the user since thelatest tweets from are rich on recent news which is generally more interesting than tweets aboutan event that occurred long time back. Merely, presenting tweets in a chronological order maybe too embarrassing to the user, especially if he has many followers. Therefore, there is a needto separate the tweets into different categories and then present the categories to the user.Nowadays Text Categorization (TC) becomes more significant especially for the Arabiclanguage which is one of the most complex languages.In this paper, in order to improve the accuracy of tweets categorization a system based onRough Set Theory is proposed for enrichment the document抯 representation. The effectivenessof our system was evaluated and compared in term of the F-measure of the Na飗e Bayesianclassifier and the Support Vector Machine classifier.
机译:Twitter是一种流行的微博服务,用户在其中创建状态消息(称为“ weets”。这些tweet有时表达对不同主题的意见;并按时间顺序显示给用户。这种表示形式对用户很有用,因为来自丰富的最新消息通常比有关很久以前发生的事件的推文更有趣。仅按时间顺序显示推文可能会使用户感到尴尬,尤其是如果他有很多关注者,因此,有必要将这些推文分开如今,文本分类(TC)变得尤为重要,特别是对于最复杂的语言之一的阿拉伯语而言。本文旨在提高推文分类的准确性。提出了基于粗糙集理论的文档表示方法,以评估文档的有效性。相对于Na Baye贝叶斯分类器和支持向量机分类器的F测度而言。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号