Tibetan Word Segmentation Based on Word-Position Tagging

机译：基于词位标注的藏文分词

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The best advantage of Tibetan word segmentation based on word-position is to reduce segmentation errors for unknown words. In this article authors upgrade usual 4-tag set to 6-tag set to fit in with the features of Tibetan characters, using CRF as tagging model to train and test corpus data, then building post processing modules to revise the result data. The experimental result shows that this method achieves a good performance and deserves further study, including expanding the corpus and optimizing the tag set and feature templates.

机译：基于词位置的藏文分词的最大优点是减少了未知词的分词错误。在本文中，作者将常用的4标记集升级为6标记集以适应藏文字符的特征，使用CRF作为标记模型来训练和测试语料库数据，然后构建后处理模块以修改结果数据。实验结果表明，该方法具有良好的性能，值得进一步研究，包括扩展语料库，优化标签集和特征模板。

著录项

来源
《International Conference on Asian Language Processing》|2013年|239-242|共4页
会议地点
作者
Kang Caijun; Jiang Di; Long Congjun;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
CRF; Tibetan; tagging model; word-position;

机译：CRF;藏语;标记模型;词位置;

相似文献

外文文献
中文文献
专利

1. Tibetan Word Segmentation and POS Tagging Research Based on Knowledge Feedback [J] . Wei Bao, Luobsang Karten Journal of Residuals Science & Technology . 2016,第8期

机译：基于知识反馈的藏语分词与POS标记研究
2. A Unified Character-Based Tagging Framework for Chinese Word Segmentation [J] . HAI ZHAO, CHANG-NING HUANG, MU LI, ACM transactions on Asian language information processing . 2010,第2期

机译：统一的基于字符的中文分词标记框架
3. A Neural Joint Model with BERT for Burmese Syllable Segmentation, Word Segmentation, and POS Tagging [J] . Mao Cunli, Man Zhibo, Yu Zhengtao, ACM transactions on Asian and low-resource language information processing . 2021,第4期

机译：具有伯尔马斯音节分割，词分割和POS标记的伯特的神经关节模型
4. Tibetan Word Segmentation Based on Word-Position Tagging [C] . Kang Caijun, Jiang Di, Long Congjun International Conference on Asian Language Processing . 2013

机译：基于单词位置标记的西藏词分割
5. Image segmentation and pigment mapping of cultural heritage based on spectral imaging. [D] . Zhao, Yonghui. 2008

机译：基于光谱成像的文化遗产图像分割和色素绘图。
6. A fine-grained Chinese word segmentation and part-of-speech tagging corpus for clinical text [O] . Ying Xiong, Zhongmin Wang, Dehuan Jiang, 2019

机译：用于临床文本的细粒度中文分词和词性标注语料库
7. A Method to Do Lexical Analysis while Word-position Tagging [O] . 黄小斌, 余悦蒙 2012

机译：词位标注时进行词法分析的方法

Tibetan Word Segmentation Based on Word-Position Tagging

摘要

著录项

相似文献

相关主题

期刊订阅