Entity Enhanced BERT Pre-training for Chinese NER

机译：实体增强了中国人的BERT预培训

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Character-level BERT pre-trained in Chinese suffers a limitation of lacking lexicon information, which shows effectiveness for Chinese NER. To integrate the lexicon into pre-trained LMs for Chinese NER, we investigate a semi-supervised entity enhanced BERT pre-training method. In particular, we first extract an entity lexicon from the relevant raw text using a new-word discovery method. We then integrate the entity information into BERT using Char-Entity-Transformer, which augments the self-attention using a combination of character and entity representations. In addition, an entity classification task helps inject the entity information into model parameters in pre-training. The pre-trained models arc used for NER fine-tuning. Experiments on a news dataset and two datasets annotated by ourselves for NER in long-text show that our method is highly effective and achieves the best results.

机译：中国人的角色级BERT接受过缺乏Lexicon信息的限制，这表明了中国人的有效性。要将Lexicon集成到中国人的预先训练的LMS中，我们调查了一个半监督实体增强型BERT预训练方法。特别是，我们首先使用新字发现方法从相关原始文本中提取实体词汇。然后，我们使用Char-Entity-Cramvery仪将实体信息集成到BERT中，这会使用字符和实体表示的组合增强自我关注。另外，实体分类任务有助于将实体信息注入预训练中的模型参数。预训练的模型用于ner微调。关于新闻数据集的实验和我们在长文本中向内注释的两个数据集显示我们的方法非常有效，实现了最佳结果。

著录项

来源
《Conference on Empirical Methods in Natural Language Processing》|2020年|6384-6396|共13页
会议地点
作者
Chen Jia; Yuefeng Shi; Qinrong Yang; Yue Zhang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Chinese named entity recognition model based on BERT [J] . Hongshuai Liu, Ge Jun, Yuanyuan Zheng MATEC Web of Conferences . 2021,第a期

机译：基于伯特的中国名称实体识别模型
2. A BERT-BiGRU-CRF Model for Entity Recognition of Chinese Electronic Medical Records [J] . Qiuli Qin, Shuang Zhao, Chunmei Liu Complexity . 2021,第a期

机译：中国电子医疗记录实体识别的BERT-BIGRU-CRF模型
3. TL-NER: A Transfer Learning Model for Chinese Named Entity Recognition [J] . DunLu Peng, YinRui Wang, Cong Liu, Information systems frontiers . 2020,第6期

机译：TL-ner：中国名称实体识别的转移学习模型
4. MVP-BERT: Multi-Vocab Pre-training for Chinese BERT [C] . Wei Zhu Annual Meeting of the Association for Computational Linguistics;International Joint Conference on Natural Language Processing . 2021

机译：MVP-BERT：Muld -CoCab预培训中国伯特
5. Entity-centric search: Querying by entities and for entities [D] . Zhou, Mianwei 2014

机译：以实体为中心的搜索：按实体和实体查询
6. Medical Named Entity Extraction from Chinese Resident Admit Notes Using Character and Word Attention-Enhanced Neural Network [O] . Yan Gao, Yandong Wang, Patrick Wang, 2020

机译：使用字符和单词注意增强神经网络从中国居民入学笔记中提取医学名称实体
7. Leveraging Concept-Enhanced Pre-Training Model and Masked-Entity Language Model for Named Entity Disambiguation [O] . Zizheng Ji, Lin Dai, Jin Pang, 2020

机译：利用概念增强的预训练模型和屏蔽实体语言模型，用于命名实体歧义

Entity Enhanced BERT Pre-training for Chinese NER

摘要

著录项

相似文献

相关主题

期刊订阅