...
首页> 外文期刊>Mathematical Problems in Engineering: Theory, Methods and Applications >Automatic Generation of the Draft Procuratorial Suggestions Based on an Extractive Summarization Method: BERTSLCA
【24h】

Automatic Generation of the Draft Procuratorial Suggestions Based on an Extractive Summarization Method: BERTSLCA

机译:基于提取摘要方法自动生成检察机关草案:Bertslca

获取原文
           

摘要

The automatic generation of the draft procuratorial suggestions is to extract the description of illegal facts, administrative omission, description of laws and regulations, and other information from the case documents. Previously, the existing deep learning methods mainly focus on context-free word embeddings when addressing legal domain-speci?c extractive summarization tasks, which cannot get a better semantic understanding of the text and in turn leads to an adverse summarization performance. To this end, we propose a novel deep contextualized embeddings-based method BERTSLCA to conduct the extractive summarization task. The model is mainly based on the variant of BERT called BERTSUM. Firstly, the input document is fed into BERTSUM to get sentence-level embeddings. Then, we design an extracting architecture to catch the long dependency between sentences utilizing the Bi-Long Short-Term Memory (Bi-LSTM) unit, and at the end of the architecture, three cascaded convolution kernels with different sizes are designed to extract the relationships between adjacent sentences. Last, we introduce an attention mechanism to strengthen the ability to distinguish the importance of different sentences. To the best of our knowledge, this is the first work to use the pretrained language model for extractive summarization tasks in the field of Chinese judicial litigation. Experimental results on public interest litigation data and CAIL 2020 dataset all demonstrate that the proposed method achieves competitive performance.
机译:自动生成的检察建议草案是从案件文档中提取违法事实,行政不作为,法律法规的说明等信息的描述。此前,已有深学习方法解决法律域SPECI 2 C采掘总结的任务,不能得到进而导致文字和更好的语义理解到不利的汇总性能时,主要集中在上下文字的嵌入。为此,我们提出了一个新颖的深基于情境的嵌入法BERTSLCA进行采掘摘要任务。该模型主要是基于BERT称为BERTSUM的变种。首先,输入文档送进BERTSUM得到语句级的嵌入。然后,我们设计了一个提取架构赶上利用双长短期内存(双LSTM)单元句子之间的长期依赖,并在建筑的结束,不同尺寸的三个级联卷积核被设计成提取相邻句子之间的关系。最后,我们引入注意的机制,以加强区分不同的句子的重要性的能力。据我们所知,这是使用采掘汇总任务预训练的语言模型在中国的司法诉讼领域的第一部作品。关于公益诉讼的数据和CAIL 2020数据集的实验结果都表明,该方法实现竞争力的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号