首页> 外文会议>IEEE International Conference on Software Maintenance >Automatic classification of software related microblogs
【24h】

Automatic classification of software related microblogs

机译:软件相关微博的自动分类

获取原文

摘要

Millions of people, including those in the software engineering communities have turned to microblogging services, such as Twitter, as a means to quickly disseminate information. A number of past studies by Treude et al., Storey, and Yuan et al. have shown that a wealth of interesting information is stored in these microblogs. However, microblogs also contain a large amount of noisy content that are less relevant to software developers in engineering software systems. In this work, we perform a preliminary study to investigate the feasibility of automatic classification of microblogs into two categories: relevant and irrelevant to engineering software systems. We extract features from the textual content of the microblogs and the titles of any URLs mentioned in the microblogs. These features are then used to learn a discriminative model used in classifying relevant and irrelevant microblogs. We show that our trained model can achieve a promising classification performance.
机译:数百万人,包括软件工程社区的人已经转向微博服务,例如Twitter,作为快速传播信息的手段。 Treude等人的过去的一些研究。,Storyy和Yuan等。已经表明,在这些微博中存储了有趣的信息。但是,微博还包含大量嘈杂的内容,这些内容与工程软件系统中的软件开发人员较低。在这项工作中,我们执行初步研究,以调查微博自动分类为两类的可行性:与工程软件系统相关和无关。我们从微博的文本内容中提取特征以及微博中提到的任何URL的标题。然后,这些特征用于学习用于分类相关和无关微博的识别模型。我们表明我们培训的模型可以实现有希望的分类性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号