【24h】

Research on Information Extraction in Judicial Field

机译:司法领域信息提取研究

获取原文

摘要

In the era of information explosion, how to effectively identify the named entities and the relationships between them from the massive judicial data is a key step to realize automatic trial, and it is also the focus of this paper. The traditional way is to use the pipeline method to identify named entities first, and then extract relationships, which will lead to the loss of related information between tasks and ignore the overlapping relationships between entities. For this reason, this paper designs a joint extraction algorithm for entities and relations in the judicial field. It uses the BiLSTM network model based on the BERT pre-training language algorithm and combines it with the attention mechanism, realizes joint learning through parameter sharing, and makes full use of the connections between tasks to optimize the results. In order to verify the effectiveness of this method, this paper constructs a small judicial data set to verify its generalization ability. Experimental results show that the model proposed in this paper can extract entities and relationships without manually defining complex features, and the average F value can reach 0.7.
机译:在信息爆炸的时代,如何有效地识别命名实体以及来自大规模司法数据之间的关系是实现自动试验的关键步骤,也是本文的重点。传统方式是首先使用管道方法来识别命名实体,然后提取关系,然后提取关系,这将导致任务之间的相关信息丢失并忽略实体之间的重叠关系。出于这个原因,本文设计了司法领域实体和关系的联合提取算法。它使用基于BERT预培训语言算法的Bilstm网络模型,并将其与注意机制相结合,实现通过参数共享的联合学习,并充分利用任务之间的连接来优化结果。为了验证这种方法的有效性,本文构建了一个小司法数据集,以验证其泛化能力。实验结果表明,本文提出的模型可以在不手动定义复杂特征的情况下提取实体和关系,并且平均f值可以达到0.7。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号