...
首页> 外文期刊>Data & Knowledge Engineering >A general framework for subjective information extraction from unstructured English text
【24h】

A general framework for subjective information extraction from unstructured English text

机译:从非结构化英文文本中提取主观信息的通用框架

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we present an information extraction (IE) strategy for handling subjective information from unstructured text. The presented methodology is general: it can be useful in many real-life applications that could potentially benefit from an automatic IE system that makes human-like decisions. We test our methodology in the sphere of company news evaluation with respect to the potential effect of the news on the company's stock prices. The described general framework comprises four sequential processing steps: part-of-speech tagging, syntactic parsing, relation generation, and criteria evaluation. The first two steps perform generic NLP tasks, while the last two phases are application-specific and require a thorough understanding of the application domain. We describe each stage and illustrate the flow of the modus operandi. We keep up with the company news evaluation example throughout the paper. Due to the inherent subjectivity of the envisaged problem, results cannot be categorically justified. However, comparing the system's evaluation of company news to our own, the results were very encouraging.
机译:在本文中,我们提出了一种信息提取(IE)策略,用于处理非结构化文本的主观信息。所介绍的方法是通用的:它在许多现实生活中很有用,可能会从自动IE系统中做出类似人的决定而受益。我们针对新闻对公司股价的潜在影响,在公司新闻评估领域中测试了我们的方法。所描述的通用框架包括四个顺序处理步骤:词性标记,语法分析,关系生成和标准评估。前两个步骤执行常规的NLP任务,而后两个阶段是特定于应用程序的,需要对应用程序域有透彻的了解。我们描述了每个阶段并说明了操作方法的流程。在整篇文章中,我们都跟上公司新闻评估示例。由于所设想问题的固有主观性,因此无法明确证明结果是正确的。但是,将系统对公司新闻的评估与我们自己的评估相比,结果令人鼓舞。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号