Extracting Knowledge Entities from Sci-Tech Intelligence Resources Based on BiLSTM and Conditional Random Field

Weizhi LIAO; Mingtong HUANG; Pan MA; Yu WANG

首页> 外文期刊>IEICE transactions on information and systems >Extracting Knowledge Entities from Sci-Tech Intelligence Resources Based on BiLSTM and Conditional Random Field

【24h】

Extracting Knowledge Entities from Sci-Tech Intelligence Resources Based on BiLSTM and Conditional Random Field

机译：基于Bilstm和条件随机字段从SCI-Tech Intelligence资源中提取知识实体

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

There are many knowledge entities in sci-tech intelligence resources. Extracting these knowledge entities is of great importance for building knowledge networks, exploring the relationship between knowledge, and optimizing search engines. Many existing methods, which are mainly based on rules and traditional machine learning, require significant human involvement, but still suffer from unsatisfactory extraction accuracy. This paper proposes a novel approach for knowledge entity extraction based on BiLSTM and conditional random field (CRF).A BiLSTM neural network to obtain the context information of sentences, and CRF is then employed to integrate global label information to achieve optimal labels. This approach does not require the manual construction of features, and outperforms conventional methods. In the experiments presented in this paper, the titles and abstracts of 20,000 items in the existing sci-tech literature are processed, of which 50,243 items are used to build benchmark datasets. Based on these datasets, comparative experiments are conducted to evaluate the effectiveness of the proposed approach. Knowledge entities are extracted and corresponding knowledge networks are established with a further elaboration on the correlation of two different types of knowledge entities. The proposed research has the potential to improve the quality of sci-tech information services.

机译：SCI-Tech Intelligence资源中有许多知识实体。提取这些知识实体对于建立知识网络，探索知识与优化搜索引擎之间的关系非常重要。许多现有的方法主要基于规则和传统机器学习，需要大量的人类参与，但仍然遭受不令人满意的提取精度。本文提出了一种基于Bilstm和条件随机字段（CRF）的知识实体提取的新方法.A Bilstm神经网络以获得句子的上下文信息，然后采用CRF集成全局标签信息以实现最佳标签。这种方法不需要手动构建功能，并且优于传统方法。在本文提出的实验中，处理了现有的SCI-Tech文献中的20,000个项目的标题和摘要，其中50,243项用于构建基准数据集。基于这些数据集，进行了比较实验，以评估所提出的方法的有效性。提取知识实体，并建立了相应的知识网络，其进一步阐述了两种不同类型的知识实体的相关性。拟议的研究有可能提高科技信息服务的质量。

著录项

来源
《IEICE transactions on information and systems》 |2021年第8期|共8页
作者
Weizhi LIAO; Mingtong HUANG; Pan MA; Yu WANG;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词
sci-tech intelligence resourcesknowledge entitysequence labelingBiLSTM-CRF;

机译：SCI-Tech Intelligence Resource Knowledge EntitySequence LabelingBilstm-CRF;

相似文献

外文文献
中文文献
专利

1. Extracting hyponymy of domain entity using Cascaded Conditional Random Fields [J] . Xiaojun Ma, Jianyi Guo, Zhengtao Yu, Pattern recognition and image analysis: advances in mathematical theory and applications in the USSR . 2017,第3期

机译：使用级联条件随机字段提取域实体的ShopMy
2. Extracting Hyponymy of Domain Entity Using Cascaded Conditional Random Fields1 [J] . Xiaojun Ma, Jianyi Guo, Zhengtao Yu, Pattern recognition and image analysis: advances in mathematical theory and applications in the USSR . 2017,第3期

机译：使用级联条件随机字段提取域实体的开槽
3. Chunk Parsing and Entity Relation Extracting to Chinese Text by Using Conditional Random Fields Model [J] . Junhua Wu, Longxia Liu Journal of Intelligent Learning Systems and Applications . 2010,第3期

机译：利用条件随机场模型对中文文本进行分块解析和实体关系提取
4. An Approach of Chunk Parsing and Entity Relation Extracting to Chinese Based on Conditional Random Fields Model [C] . Wu Jun-hua, Zhou Jing International Conference on Intelligent Systems Design and Applications . 2008

机译：基于条件随机字段模型的Chunk解析与实体关系的方法
5. Conditional Random Fields With Lasso and Its Application to the Classification of Barley Genes Based on Expression Level Affected by Fungal Infection [D] . Liu, Xiyuan. 2019

机译：基于真菌感染表达水平的带套索条件随机场及其在大麦基因分类中的应用
6. SBLC: a hybrid model for disease named entity recognition based on semantic bidirectional LSTMs and conditional random fields [O] . Kai Xu, Zhanfan Zhou, Tao Gong, 2018

机译：SBLC：基于语义双向LSTM和条件随机场的疾病命名实体识别混合模型
7. Chunk Parsing and Entity Relation Extracting to Chinese Text by Using Conditional Random Fields Model [O] . Junhua Wu, Longxia Liu 2010

机译：利用条件随机场模型对中文文本进行分块解析和实体关系提取

Extracting Knowledge Entities from Sci-Tech Intelligence Resources Based on BiLSTM and Conditional Random Field

摘要

著录项

相似文献

相关主题

期刊订阅