Kernel based part of speech tagger for Kannada

机译：基于内核的Kannada语音标记器部分

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The proposed paper presents the development of a part-of-speech tagger for Kannada language that can be used for analyzing and annotating Kannada texts. POS tagging is considered as one of the basic tool and component necessary for many Natural Language Processing (NLP) applications like speech recognition, natural language parsing, information retrieval and information extraction of a given language. In order to alleviate problems for Kannada language, we proposed a new machine learning POS tagger approach. Identifying the ambiguities in Kannada lexical items is the challenging objective in the process of developing an efficient and accurate POS Tagger. We have developed our own tagset which consist of 30 tags and built a part-of-speech Tagger for Kannada Language using Support Vector Machine (SVM). A corpus of texts, extracted from Kannada news papers and books, is manually morphologically analyzed and tagged using our developed tagset. The performance of the system is evaluated and we found that the result obtained was more efficient and accurate compared with earlier methods for Kannada POS tagging.

机译：拟议论文介绍了用于Kannada语言的词性标记器的开发，该标记器可用于分析和注释Kannada文本。 POS标记被认为是许多自然语言处理（NLP）应用程序（例如语音识别，自然语言解析，信息检索和给定语言的信息提取）所必需的基本工具和组件之一。为了减轻卡纳达语的问题，我们提出了一种新的机器学习POS标记器方法。在开发高效，准确的POS Tagger的过程中，识别Kannada词汇项目中的歧义是具有挑战性的目标。我们已经开发了自己的标签集，该标签集包含30个标签，并使用支持向量机（SVM）为卡纳达语语言构建了词性Tagger。从卡纳达语报纸和书籍中提取的文本语料库使用我们开发的标记集进行了手工形态分析和标记。对系统的性能进行了评估，我们发现与Kannada POS标记的早期方法相比，所获得的结果更加有效和准确。

著录项

来源
《Proceedings of the Ninth International Conference on Machine Learning and Cybernetics》|2010年|2139-2144|共6页
会议地点
作者
Antony P. J; Soman K. P;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动推理、机器学习;
关键词
Classification; Kannada; NLP; POS Tagger; Support Vector Machine;

机译：分类; Kannada; NLP; POS Tagger;支持向量机;

相似文献

外文文献
中文文献
专利

1. Fast and Accurate Parts of Speech Tagging for Kannada-Telugu Pair [J] . Chandramma, Piyush Kumar Pareek International Journal of Applied Engineering Research . 2018,第10aPta5期

机译：Kannada-Telugu对的快速准确的言语标记部分
2. Segment-level probabilistic sequence kernel and segment-level pyramid match kernel based extreme learning machine for classification of varying length patterns of speech [J] . Shikha Gupta, Ahmed Karanath, Kansul Mahrifa, International journal of speech technology . 2019,第1期

机译：基于段级概率序列核和段级金字塔匹配核的极限学习机，用于语音不同长度模式的分类
3. Enhancements in automatic Kannada speech recognition system by background noise elimination and alternate acoustic modelling [J] . G. Thimmaraja Yadava, H. S. Jayanna International journal of speech technology . 2020,第1期

机译：通过背景噪声消除和替代声学建模增强了自动Kannada语音识别系统
4. Kernel based part of speech tagger for Kannada [C] . Antony P. J, Soman K. P International Conference on Machine Learning and Cybernetics . 2010

机译：基于Kernel的kannada的言语标签
5. A neural network speech tagger based on rough set attribute reduction. [D] . Zhong, Xin. 2014

机译：基于粗糙集属性约简的神经网络语音标记器。
6. Biologically-Inspired Spike-Based Automatic Speech Recognition of Isolated Digits Over a Reproducing Kernel Hilbert Space [O] . Kan Li, José C. Príncipe 2018

机译：仿生希尔伯特空间上基于数字启发的基于穗的孤立数字自动语音识别
7. Parts of Speech Tagging for Kannada [O] . Swaroop L R, Rakshit Gowda G S, Shriram Hegde, 2019

机译：kannada的讲话标记部分

Kernel based part of speech tagger for Kannada

摘要

著录项

相似文献

相关主题

期刊订阅