MulCode: A Multi-task Learning Approach for Source Code Understanding

机译：mulcode：源代码理解的多任务学习方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recent years have witnessed the significant rise of Deep Learning (DL) techniques applied to source code. Researchers exploit DL for a multitude of tasks and achieve impressive results. However, most tasks are explored separately, resulting in a lack of generalization of the solutions. In this work, we propose MulCode, a multi-task learning approach for source code understanding that learns unified representation space for tasks, with the pre-trained BERT model for the token sequence and the Tree-LSTM model for abstract syntax trees. Furthermore, we integrate two source code views into a hybrid representation via the attention mechanism and set learnable uncertainty parameters to adjust the tasks’ relationship.We train and evaluate MulCode in three downstream tasks: comment classification, author attribution, and duplicate function detection. In all tasks, MulCode outperforms the state-of-the-art techniques. Moreover, experiments on three unseen tasks demonstrate the generalization ability of MulCode compared with state-of-the-art embedding methods.

机译：近年来已经见证了应用于源代码的深度学习的显着升高（DL）技术。研究人员利用DL的众多任务，实现了令人印象深刻的结果。但是，大多数任务是单独探索的，导致解决方案的概念缺乏。在这项工作中，我们提出了Mulcode，一种用于源代码的多任务学习方法，用于了解任务的统一表示空间，具有令牌序列的预先训练的BERT模型和抽象语法树的树-LSTM模型。此外，我们通过注意机制将两个源代码视图集成到混合表示中，并设置了学习的不确定性参数来调整任务的关系。我们在三个下游任务中培训和评估Mulcode：注释分类，作者归因和重复功能检测。在所有任务中，Mulcode优于最先进的技术。此外，三个看不见任务的实验证明了与最先进的嵌入方法相比莫尔基铁的泛化能力。

著录项

来源
《IEEE International Conference on Software Analysis, Evolution and Reengineering》|2021年|48-59|共12页
会议地点
作者
Deze Wang; Yue Yu; Shanshan Li; Wei Dong; Ji Wang; Liao Qing;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Deep learning; Uncertainty; Conferences; Bit error rate; Syntactics; Software; Task analysis;

机译：深入学习;不确定性;会议;误码率;语法;软件;任务分析;

相似文献

外文文献
中文文献
专利

1. Multi-task learning based Encoder-DecoderA comprehensive detection and diagnosis system for multi-sensor data [J] . Junfeng Wu, Li Yao, Bin Liu, Advances in Mechanical Engineering . 2021,第5期

机译：基于多任务学习的编码器 - Decoder-Decodera综合检测和诊断系统，用于多传感器数据
2. Multi-Task Learning Using Attention-Based Convolutional Encoder-Decoder for Dilated Cardiomyopathy CMR Segmentation and Classification [J] . Chao Luo, Canghong Shi, Xiaojie Li, Computers, Materials & Continua . 2020,第2期

机译：用于扩张心肌病CMR分割和分类的基于注意力的卷积编码器 - 解码器的多任务学习
3. Feedback2Code: A Deep Learning Approach to Identifying User-Feedback-Related Source Code Files [J] . Shuhan Yan, Tianjiao Du, Beijun Shen, International journal of software engineering and knowledge engineering . 2020,第1期

机译：Feedback2Code：一种深度学习方法，用于识别与用户反馈相关的源代码文件
4. When is Multi-task Learning Beneficial for Low-Resource Noisy Code-switched User-generated Algerian Texts? [C] . Wafia Adouane, Jean-Philippe Bernardy Workshop on Computational Approaches to Code Switching . 2020

机译：多任务学习何时对低资源嘈杂的代码转换用户生成的阿尔及利亚文本有利？
5. Learning compact hashing codes with complex objectives from multiple sources for large scale similarity search [D] . Wang, Qifan 2015

机译：从多个来源学习具有复杂目标的紧凑型哈希码，以进行大规模相似性搜索
6. Learning Doctors’ Medicine Prescription Pattern for Chronic Disease Treatment by Mining Electronic Health Records: A Multi-Task Learning Approach [O] . Eryu Xia, Jing Mei, Guotong Xie, 2017

机译：通过挖掘电子健康记录来学习用于慢性病治疗的医生处方药模式：一种多任务学习方法
7. Exploiting Method Names to Improve Code Summarization: A Deliberation Multi-Task Learning Approach [O] . Rui Xie, Wei Ye, Jinan Sun, 2021

机译：利用方法名称来提高代码摘要：审议多任务学习方法

MulCode: A Multi-task Learning Approach for Source Code Understanding

摘要

著录项

相似文献

相关主题

期刊订阅