您现在的位置: 首页> 研究主题> NLP

NLP

NLP的相关文献在1997年到2023年内共计462篇,主要集中在自动化技术、计算机技术、教育、财政、金融 等领域,其中期刊论文257篇、专利文献205篇;相关期刊209种,包括中小学心理健康教育、社会心理科学、新西部(下旬刊)等; NLP的相关文献由1100位作者贡献,包括詹启敏、Fahd N.Al-Wesabi、温永选等。

NLP—发文量

期刊论文>

论文:257 占比:55.63%

专利文献>

论文:205 占比:44.37%

总计:462篇

NLP—发文趋势图

NLP

-研究学者

  • 詹启敏
  • Fahd N.Al-Wesabi
  • 温永选
  • 付修锋
  • 任俊珍
  • 凌波
  • 夏飞
  • 孔东泉
  • 宋咏梅
  • 张胜
  • 期刊论文
  • 专利文献

搜索

排序:

年份

    • YeolWoo Sung; Dae Seung Park; Cheong Ghil Kim
    • 摘要: The entry into a hyper-connected society increases the generalization of communication using SNS.Therefore,research to analyze big data accumulated in SNS and extractmeaningful information is being conducted in various fields.In particular,with the recent development of Deep Learning,the performance is rapidly improving by applying it to the field of Natural Language Processing,which is a language understanding technology to obtain accurate contextual information.In this paper,when a chatbot system is applied to the healthcare domain for counseling about diseases,the performance of NLP integrated withmachine learning for the accurate classification ofmedical subjects from text-based health counseling data becomes important.Among the various algorithms,the performance of Bidirectional Encoder Representations from Transformers was compared with other algorithms of CNN,RNN,LSTM,and GRU.For this purpose,the health counseling data of Naver Q&A service were crawled as a dataset.KoBERT was used to classify medical subjects according to symptoms and the accuracy of classification results was measured.The simulation results show that KoBERTmodel performed high performance by more than 5%and close to 18%as large as the smallest.
    • 尕藏才让
    • 摘要: 文章主要阐述了自然语言处理的基础技术。首先,文章介绍了藏汉两种文字自然语言处理(NLP)技术的发展历程;其次,文章比较了藏汉两种语言的字,词、句和篇章等层面的信息处理方法的差异性,试图探索和挖掘跨语种的自然语言处理之间的语法特点,从而选取符合语法特征的自然语言处理方法,做到因语施策。
    • 刘邵宏
    • 摘要: 文章以IT高水平专业群建设为例,通过自然语言处理(NLP)对IT产业岗位需求进行了大数据分析,依次采用DBSCAN聚类和LDA概率主题模型,分析得到了4类岗位簇与5个技能模块关系矩阵。文章采用LTP依存句法提取岗位簇技能、知识、素质模块,进一步为厘清高水平专业群组群逻辑、完善人才培养模式、增强职业技术教育适应性提供了技术支持。
    • 李思雨; 程芃森; 刘嘉勇
    • 摘要: 深度学习技术的发展使得基于深度神经网络的方法成为自然语言处理(Natural Language Processing,NLP)领域的一种新解决思路。虽然神经网络技术能够有效提升生成文本的质量,但是生成文本的内容很容易偏离作者原本要表达的语义。笔者按照人们的写作习惯,即先构思出各部分的主题再进行写作的方式,提出基于交叉项编码的关键词主题控制文本生成模型。与其他模型相比,该模型生成的句子不仅在双语互译质量评估(Bilingual Evaluation Understudy,BLEU)中的得分更高,Correlation值也要高于其他模型。
    • 苏天; 龚炳江
    • 摘要: BERT是谷歌AI团队近年来新发布的自然语言预训练模型,在11种不同的NLP测试中创出最佳成绩,被认为是NLP领域中里程碑式的进步,因此利用BERT进行文本情感分析是一个很热门的研究方向,该文中水利舆情分析主要是对水利新闻进行情感分析。该文对基于词典、机器学习和深度学习的情感分类技术进行了分析,并提出了基于完整句分割的BERT-BiLSTM水利新闻文本情感分类模型。该课题可以为水利行业从业人员和其他领域的情感分类研究提供较高的指导意义。
    • 陈小丽; 谢才夫
    • 摘要: 针对某公司基于风险防控的内控合规体系数字化、智能化程度不足的问题,探索利用NLP技术,构建面向项目的内控合规的风险指标体系,搭建风险评级智能化模型,并通过机器学习逐步完善及应用,最终推动了某公司内控合规体系清晰、透明、高效运作,实现内控合规风险数字化智能化管控。
    • 付修锋; 贾张涛; 杨铁湃; 安恒; 金玉川; 耿宏伟
    • 摘要: 在软件功能开发过程中,会存在待开发功能在市面上已存在或者有相似的情况,为了节省开发成本,程序员普遍会选择用代码复用的方式解决项目"冷启动"问题,这也降低了开发成本。换言之,代码复用已经逐渐变成行业所接受的开发模式。但其中仍然存在诸多问题和安全隐患,如恶意软件代码、缺失许可认证等,所以代码安全监测是软件安全发展的重要手段。文章基于复用代码监测,从代码之间语义特征角度出发,设计了一种精准的袪码特征提取算法,并在此基础上实现二进制代码复用检测。实验结果表明,二进制工业软件溯源方法可以完成代码复用检测工作,并且在文件级、函数级维度都体现出良好的准确性。
    • 洪季芳
    • 摘要: 近年来,Transformer已经成为自然语言处理之中一个新兴的热门研究领域,越来越多的研究人员加入这项技术研究中,提出了很多具有一定突破的方法。将对相关技术进行总结,介绍原理和特征提取方法,此外,针对新兴研究方向进行介绍和展望。
    • WooHyun Park; Nawab Muhammad Faseeh Qureshi; Dong Ryeol Shin
    • 摘要: Spam mail classification considered complex and error-prone task in the distributed computing environment.There are various available spam mail classification approaches such as the naive Bayesian classifier,logistic regression and support vector machine and decision tree,recursive neural network,and long short-term memory algorithms.However,they do not consider the document when analyzing spam mail content.These approaches use the bagof-words method,which analyzes a large amount of text data and classifies features with the help of term frequency-inverse document frequency.Because there are many words in a document,these approaches consume a massive amount of resources and become infeasible when performing classification on multiple associated mail documents together.Thus,spam mail is not classified fully,and these approaches remain with loopholes.Thus,we propose a term frequency topic inverse document frequency model that considers the meaning of text data in a larger semantic unit by applying weights based on the document’s topic.Moreover,the proposed approach reduces the scarcity problem through a frequency topic-inverse document frequency in singular value decomposition model.Our proposed approach also reduces the dimensionality,which ultimately increases the strength of document classification.Experimental evaluations show that the proposed approach classifies spam mail documents with higher accuracy using individual document-independent processing computation.Comparative evaluations show that the proposed approach performs better than the logistic regression model in the distributed computing environment,with higher document word frequencies of 97.05%,99.17%and 96.59%.
    • Venkateswara Rao Kota; Shyamala Devi Munisamy
    • 摘要: Purpose-Neural network(NN)-based deep learning(DL)approach is considered for sentiment analysis(SA)by incorporating convolutional neural network(CNN),bi-directional long short-term memory(Bi-LSTM)and attention methods.Unlike the conventional supervised machine learning natural language processing algorithms,the authors have used unsupervised deep learning algorithms.Design/methodology/approach-The method presented for sentiment analysis is designed using CNN,Bi-LSTM and the attention mechanism.Word2vec word embedding is used for natural language processing(NLP).The discussed approach is designed for sentence-level SA which consists of one embedding layer,two convolutional layers with max-pooling,oneLSTMlayer and two fully connected(FC)layers.Overall the system training time is 30 min.Findings-The method performance is analyzed using metrics like precision,recall,F1 score,and accuracy.CNN is helped to reduce the complexity and Bi-LSTM is helped to process the long sequence input text.Originality/value-The attention mechanism is adopted to decide the significance of every hidden state and give a weighted sum of all the features fed as input.
  • 查看更多

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号