首页> 外文会议>AAAI Conference on Artificial Intelligence >Document Type Classification in Online Digital Libraries
【24h】

Document Type Classification in Online Digital Libraries

机译:在线数字图书馆中的文档类型分类

获取原文

摘要

Online digital libraries make it easier for researchers to search for scientific information. They have been proven as powerful resources in many data mining, machine learning and information retrieval applications that require high-quality data. The quality of the data highly depends on the accuracy of classifiers that identify the types of documents that are crawled from the Web, e.g., as research papers, slides, books, etc., for appropriate indexing. These classifiers in turn depend on the choice of the feature representation. We propose novel features that result in high-accuracy classifiers for document type classification. Experimental results on several datasets show that our classifiers outperform models that are employed in current systems.
机译:在线数字图书馆使研究人员更容易搜索科学信息。他们已被证明是许多数据挖掘,机器学习和信息检索应用中的强大资源,需要高质量数据。数据的质量高度取决于识别从网络爬出的文档类型的分类器的准确性,例如,作为适当索引的研究论文,幻灯片,书籍等。这些分类器依次依赖于要素表示的选择。我们提出了新的功能,从而导致文档类型分类的高精度分类器。若干数据集上的实验结果表明我们的分类器优于当前系统中使用的模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号