Import2vec: Learning Embeddings for Software Libraries

机译：Import2VEC：用于软件库的学习嵌入式

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider the problem of developing suitable learning representations (embeddings) for library packages that capture semantic similarity among libraries. Such representations are known to improve the performance of downstream learning tasks (e.g. classification) or applications such as contextual search and analogical reasoning. We apply word embedding techniques from natural language processing (NLP) to train embeddings for library packages ("library vectors"). Library vectors represent libraries by similar context of use as determined by import statements present in source code. Experimental results obtained from training such embeddings on three large open source software corpora reveals that library vectors capture semantically meaningful relationships among software libraries, such as the relationship between frameworks and their plug-ins and libraries commonly used together within ecosystems such as big data infrastructure projects (in Java), front-end and back-end web development frameworks (in JavaScript) and data science toolkits (in Python).

机译：我们考虑开发合适的学习表示（嵌入）的库包，用于捕获图书馆之间的语义相似性的库包。已知这些代表可以改善下游学习任务（例如分类）或诸如上下文搜索和类比推理的应用的性能。我们将嵌入技术从自然语言处理（NLP）应用于培训库包的嵌入式（“库向量”）。库向量代表库通过使用源代码中存在的导入语句确定的类似上下文来表示库。从训练中获得的实验结果在三个大型开源软件上进行了培训，揭示了图书馆向量捕获了软件库之间的语义有意义的关系，例如框架和他们的插件和库之间的关系，通常在大数据基础架构项目等生态系统中一起使用（在Java中），前端和后端Web开发框架（JavaScript）和数据科学工具包（在Python中）。

著录项

来源
《IEEE/ACM International Conference on Mining Software Repositories》|2019年|xxxiv 606 p. :|共11页
会议地点
作者
Bart Theeten; Frederik Vandeputte; Tom Van Cutsem;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类安全保密;
关键词
Big Data; C++ language; Java; learning (artificial intelligence); natural language processing; public domain software; software libraries;

机译：大数据;C ++语言;Java;学习（人工智能）;自然语言处理;公共领域软件;软件图书馆;

相似文献

外文文献
中文文献
专利

1. Complex instruction and software library mapping for embedded software using symbolic algebra [J] . Peymandoust A., Simunic T., De Micheli G. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems . 2003,第8期

机译：使用符号代数的嵌入式软件的复杂指令和软件库映射
2. The wisdom of embedding student assistants in library learning workflows: Focus on listening and learning [J] . Brett Bodemer College & research libraries news . 2016,第7期

机译：在图书馆学习工作流程中嵌入助教的智慧：专注于聆听和学习
3. University of Wollongong Library: Embedding Learning and Development as Part of Our Organisational DNA [J] . Donna Dee, Keith Brophy, Kristy Newton The international information & library review . 2020,第3期

机译：卧龙岗大学图书馆：将学习和开发嵌入为组织DNA的一部分
4. Import2vec: Learning Embeddings for Software Libraries [C] . Bart Theeten, Frederik Vandeputte, Tom Van Cutsem IEEE/ACM International Conference on Mining Software Repositories . 2019

机译：Import2vec：学习软件库的嵌入
5. Intrusion Detection: Embedded Software Machine Learning and Hardware Rules Based Co-designs [D] . Abdulhammed, Razan 2019

机译：入侵检测：基于嵌入式软件机器学习和硬件规则的协同设计
6. CRISPR library designer (CLD): software for multispecies design of single guide RNA libraries [O] . Florian Heigwer, Tianzuo Zhan, Marco Breinig, 2016

机译：CRISPR库设计器（CLD）：用于单向导RNA库的多物种设计的软件
7. Complex Instruction and Software Library Mapping for Embedded Software Using Symbolic Algebra [O] . Armita Peymandoust, Tajana Simunic, Giovanni De Micheli 2003

机译：使用符号代数的嵌入式软件的复杂指令和软件库映射
8. Quantitative Analysis of Embedded Software Using Game-Theoretic Learning [R] . Seshia, S. A., Rakhlin, A. 2009

机译：基于博弈论学习的嵌入式软件定量分析

Import2vec: Learning Embeddings for Software Libraries

摘要

著录项

相似文献

相关主题

期刊订阅