Tackling the Low-resource Challenge for Canonical Segmentation

机译：解决规范分割的低资源挑战

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Canonical morphological segmentation consists of dividing words into their standardized morphemes. Here, we are interested in approaches for the task when training data is limited. We compare model performance in a simulated low-resource setting for the high-resource languages German, English, and Indonesian to experiments on new datasets for the truly low-resource languages Popoluca and Tepehua. We explore two new models for the task, borrowing from the closely related area of morphological generation: an LSTM pointer-generator and a sequence-to-sequence model with hard monotonic attention trained with imitation learning. We find that, in the low-resource setting, the novel approaches outperform existing ones on all languages by up to 11.4% accuracy. However, while accuracy in emulated low-resource scenarios is over 50% for all languages, for the truly low-resource languages Popoluca and Tepehua, our best model only obtains 37.4% and 28.4% accuracy, respectively. Thus, we conclude that canonical segmentation is still a challenging task for low-resource languages.

机译：典型形态分割包括把话到他们的标准化语素。在这里，我们感兴趣的是该任务的方法时，训练数据是有限的。我们比较了高资源德语，英语一模拟低资源设置模型的性能，以及印尼对的真正低资源语言Popoluca和Tepehua新的数据集实验。我们探索两个新型号的任务，从形态产生密切相关的领域借用：一个LSTM指针发生器和序列到序列模型与模仿学习刻苦训练单调的关注。我们发现，在低资源设置，新的方法跑赢大盘高达11.4％的准确率在所有语言现有的。然而，虽然精度模拟低资源场景是所有语言的超过50％，对于真正的低资源语言Popoluca和Tepehua，我们最好的模型只取得精度37.4％和28.4％，分别。因此，我们得出结论，规范分割仍是低资源语言一项艰巨的任务。

著录项

来源
《Conference on Empirical Methods in Natural Language Processing》|2020年|5237-5250|共14页
会议地点
作者
Manuel Mager; OEzlem Cetinoglu; Katharina Kann;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. The Global One Health Paradigm: Challenges and Opportunities for Tackling Infectious Diseases at the Human, Animal, and Environment Interface in Low-Resource Settings [J] . Wondwossen A. Gebreyes, Jean Dupouy-Camet, Melanie J. Newport, PLOS Neglected Tropical Diseases . 2014,第11期

机译：全球单一健康范例：在资源匮乏的环境中应对人，动物和环境界面上的传染病的挑战和机遇
2. The Global One Health Paradigm: Challenges and Opportunities for Tackling Infectious Diseases at the Human, Animal, and Environment Interface in Low-Resource Settings [J] . Wondwossen A. Gebreyes, Jean Dupouy-Camet, Melanie J. Newport, PLOS Neglected Tropical Diseases . 2014,第11期

机译：全球单一健康范例：在资源匮乏的环境中应对人，动物和环境界面上的传染病的挑战和机遇
3. Tackling the Fallout From Chronic Kidney Disease of Unknown Etiology: Why We Need to Focus on Providing Peritoneal Dialysis in Rural, Low-Resource Settings [J] . Nishanthe Nanayakkara, A.W.M. Wazil, Lishanthe Gunerathne, Kidney International Reports . 2017,第1期

机译：解决病因不明的慢性肾脏疾病的后果：为什么我们需要集中精力在农村，资源贫乏地区提供腹膜透析
4. Well Intervention Challenges in Mega-Reach Wells with Coiled Tubing and the Application of the Latest Technologies in Tackling These Challenges, Saudi Arabia [C] . M. Dhufairi, J. Arukhe, T. Elsherif Coiled Tubing and Well Intervention Conference and Exhibition . 2013

机译：梅尔加井的良好干预挑战，带有盘绕管道的井和应用最新技术在沙特阿拉伯解决这些挑战中的应用
5. Tackling Challenges Related to Thick Electrodes in Platinum Group Metal Free Polymer Electrolyte Fuel Cells [D] . Dunsmore, Lisa. 2021

机译：在铂族金属无聚合物电解质电解质燃料电池中处理与厚电极相关的挑战
6. The Global One Health Paradigm: Challenges and Opportunities for Tackling Infectious Diseases at the Human Animal and Environment Interface in Low-Resource Settings [O] . Wondwossen A. Gebreyes, Jean Dupouy-Camet, Melanie J. Newport, 2014

机译：全球单一健康范例：在资源匮乏的环境中应对人动物和环境界面上的传染病的挑战和机遇
7. The global one health paradigm: challenges and opportunities for tackling infectious diseases at the human, animal, and environment interface in low-resource settings. [O] . Wondwossen A Gebreyes, Jean Dupouy-Camet, Melanie J Newport, 2014

机译：全球一种健康模式：在资源匮乏的环境中应对人类，动物和环境界面的传染病的挑战和机遇。

Tackling the Low-resource Challenge for Canonical Segmentation

摘要

著录项

相似文献

相关主题

期刊订阅