Codeswitched Sentence Creation using Dependency Parsing

机译：使用依赖性解析创建代码溺爱的句子

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Codeswitching has become one of the most common occurrences across multilingual speakers of the world, especially in countries like India which encompasses around 23 official languages with the number of bilingual speakers being around 300 million. The scarcity of Codeswitched data becomes a bottleneck in the exploration of this domain with respect to various Natural Language Processing (NLP) tasks. We thus present a novel algorithm which harnesses the syntactic structure of English grammar to develop grammatically sensible Codeswitched versions of English-Hindi, English-Marathi and English-Kannada data. Apart from maintaining the grammatical sanity to a great extent, our methodology also guarantees abundant generation of data from a minuscule snapshot of given data. We use multiple datasets to showcase the capabilities of our algorithm while at the same time we assess the quality of generated Codeswitched data using some qualitative metrics along with providing baseline results for couple of NLP tasks.

机译：CodeSwitching已经成为世界各地多语种演讲者的最常见事件之一，特别是在印度等国家，包含大约23个官方语言，双语扬声器数量约为3亿。代号的稀缺性数据是关于各种自然语言处理（NLP）任务探索该域的瓶颈。因此，我们提出了一种新颖的算法，它利用英语语法的句法结构，开发语法合理的英语 - 印地文，英语 - Marathi和英语 - kannada数据的文字。除了在很大程度上保持语法理智之外，我们的方法还可以保证来自给定数据的微量快照的丰富生成数据。我们使用多个数据集来展示我们算法的功能，同时我们使用一些定性度量评估生成的代码开关数据的质量以及为基于NLP任务提供基线结果。

著录项

来源
《IEEE International Conference on Semantic Computing》|2021年|124-129|共6页
会议地点
作者
Dhruval Jain; Arun D Prabhu; Shubham Vatsal; Gopi Ramena; Naresh Purre;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Measurement; Conferences; Semantics; Syntactics; Natural language processing; Grammar; Task analysis;

机译：测量;会议;语义;句法;自然语言处理;语法;任务分析;

相似文献

外文文献
中文文献
专利

1. An ERP study of parsing and memory load in Japanese sentence processing - A comparison between left-corner parsing and the Dependency Locality Theory [J] . Shodai Uchida, Edson T. Miyamoto, Yuki Hirose, 電子情報通信学会技術研究報告. 思考と言語. Thought and Language . 2014,第176期

机译：日语句子处理中的解析和内存负载的ERP研究—左角解析与依存局部性理论的比较
2. Neural Dependency Parser for Tibetan Sentences [J] . An Bo, Long Congjun ACM transactions on Asian and low-resource language information processing . 2021,第2期

机译：西藏句子的神经依赖解析器
3. Exploring global sentence representation for graph-based dependency parsing using BLSTM-SCNN [J] . Si Nianwen, Wang Hengjun, Shan Yidong Pattern recognition letters . 2018,第APRa1期

机译：使用BLSTM-SCNN探索基于图的依存关系解析的全局语句表示
4. How to Train Dependency Parsers with Inexact Search for Joint Sentence Boundary Detection and Parsing of Entire Documents [C] . Anders Bjorkelund, Agnieszka Falehska, Wolfgang Seeker, Annual meeting of the Association for Computational Linguistics . 2016

机译：如何通过不精确搜索来训练依赖分析器，以进行联合语句边界检测和整个文档的分析
5. Towards Effective Domain Adaptation of Dependency Parsing [D] . Mukherjee, Atreyee. 2020

机译：朝着有效的域改编依赖解析
6. Acceptable Ungrammatical Sentences Unacceptable Grammatical Sentences and the Role of the Cognitive Parser [O] . Evelina Leivada, Marit Westergaard 2020

机译：可接受的非语法句子不可接受的语法句子以及认知解析器的作用
7. Zero-shot Dependency Parsing with Pre-trained Multilingual Sentence Representations [O] . Ke Tran, Arianna Bisazza 2019

机译：用预先训练的多语言句子表示解析零拍依赖关系

Codeswitched Sentence Creation using Dependency Parsing

摘要

著录项

相似文献

相关主题

期刊订阅