【24h】

Exploiting RFC2828 as a Domain Vocabulary for Identifying IT Security Literature

机译:利用RFC2828作为识别IT安全文献的领域词汇

获取原文

摘要

The volume of published scientific literature available on Internet has been increasing exponentially. Some of them reflect the latest achievement of the specific research domain. In recent years, many projects have been funded aiming to online scientific literature mining, especially in biomedical research. Scientific literature covers most of the hot topics in the research field and has a very large domainspecific vocabulary. The exploitation of domain knowledge and specialized vocabulary can dramatically improve the result of literature text processing. The purpose of this paper is to research on automatic identifying and classifying IT security literature so that IT security related papers can be retrieved from Internet with high accuracy. RFC 2828 provides explanations and recommendations for use of IT security terminology. In this paper, we evaluated the effects of IT security literatures identification with RFC2828 glossary-based feature choice and TF/IDF scheme. Our experimental result shows that its performance is better than the common TF/IDF method.
机译:互联网上公开发表的科学文献数量呈指数增长。其中一些反映了特定研究领域的最新成就。近年来,许多项目旨在在线进行在线科学文献挖掘,特别是在生物医学研究中。科学文献涵盖了研究领域中的大多数热门话题,并且具有非常具体的领域词汇。领域知识和专业词汇的开发可以大大改善文学文本处理的结果。本文的目的是研究自动识别和分类IT安全文献,以便可以从Internet高精度检索与IT安全相关的论文。 RFC 2828提供使用IT安全术语的解释和建议。在本文中,我们使用基于RFC2828词汇表的功能选择和TF / IDF方案评估了IT安全文献识别的效果。我们的实验结果表明,它的性能优于普通的TF / IDF方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号