首页> 外文会议>ACM/IEEE-CS joint conference on digital libraries >A Hybrid Two-Stage Approach for Discipline-Independent Canonical Representation Extraction from References
【24h】

A Hybrid Two-Stage Approach for Discipline-Independent Canonical Representation Extraction from References

机译:从参考文献中独立于学科的规范表示的混合两阶段方法

获取原文

摘要

In education and research, references play a key role. However, extracting and parsing references are difficult problems. One concern is that there are many styles of references; hence, given a surface form, identifying what style was employed is problematic, especially in heterogeneous collections of theses and dissertations, which cover many fields and disciplines, and where different styles may be used even in the same publication. We address these problems by drawing upon suitable knowledge found in the WWW. In particular, we research a two-stage classifier approach, involving multi-class classification with respect to reference styles, and partially solve the problem of parsing surface representations of references. We describe empirical evidence for the effectiveness of our approach and plans for improvement of our methods.
机译:在教育和研究中,参考文献起着关键作用。但是,提取和解析引用是困难的问题。一个令人担忧的是,引用的样式很多。因此,给定一个表面形式,确定采用哪种样式是有问题的,尤其是在这些论文和论文的异构集合中,这些集合涵盖了许多领域和学科,并且即使在同一出版物中也可以使用不同的样式。我们通过利用WWW中的适当知识来解决这些问题。特别是,我们研究了一种两阶段分类器方法,该方法涉及针对参考样式的多类分类,并部分解决了解析参考的表面表示的问题。我们描述了我们方法的有效性的经验证据以及改进方法的计划。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号