首页> 外文期刊>Natural language engineering >Identifying off-topic student essays without topic-specific training data
【24h】

Identifying off-topic student essays without topic-specific training data

机译:在没有特定主题的培训数据的情况下,识别出题外的学生论文

获取原文
获取原文并翻译 | 示例
           

摘要

Educational assessment applications, as well as other natural-language interfaces, need some mechanism for validating user responses. If the input provided to the system is infelicitous or uncooperative, the proper response may be to simply reject it, to route it to a bin for special processing, or to ask the user to modify the input. If problematic user input is instead handled as if it were the system's normal input, this may degrade users' confidence in the software, or suggest ways in which they might try to "game" the system. Our specific task in this domain is the identification of student essays which are "off-topic", or not written to the test question topic. Identification of off-topic essays is of great importance for the commercial essay evaluation system Criterion~(SM). The previous methods used for this task required 200-300 human scored essays for training purposes. However, there are situations in which no essays are available for training, such as when users (teachers) wish to spontaneously write a new topic for their students. For these kinds of cases, we need a system that works reliably without training data. This paper describes an algorithm that detects when a student's essay is off-topic without requiring a set of topic-specific essays for training. This new system is comparable in performance to previous models which require topic-specific essays for training, and provides more detailed information about the way in which an essay diverges from the requested essay topic.
机译:教育评估应用程序以及其他自然语言界面需要某种机制来验证用户的响应。如果提供给系统的输入是微不足道的或不合作的,则正确的响应可能是简单地拒绝它,将其路由到垃圾箱以进行特殊处理或要求用户修改输入。如果相反地将有问题的用户输入当作系统的正常输入来处理,则这可能会降低用户对软件的信心,或者建议他们尝试“游戏”系统的方式。在此领域中,我们的具体任务是识别“离题”或未写到测试题的学生论文。题外论文的识别对于商业论文评估系统Criterion〜(SM)至关重要。用于该任务的先前方法需要200-300个人评分的论文以进行培训。但是,在某些情况下,没有可供培训的论文,例如当用户(教师)希望为他们的学生自发编写新主题时。对于此类情况,我们需要一种无需训练数据即可可靠运行的系统。本文介绍了一种算法,该算法可检测学生的论文何时偏离主题,而无需训练一组特定主题的论文。这个新系统的性能与以前的模型相当,后者需要特定主题的论文进行训练,并且提供了有关论文与所请求的论文主题不同的方式的详细信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号