Enhanced protein domain discovery by using language modeling techniques from speech recognition

机译：通过使用语音识别中的语言建模技术来增强蛋白质结构域发现

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Most modern speech recognition uses probabilistic models to interpret a sequence of sounds. Hidden Markov models, in particular, are used to recognize words. The same techniques have been adapted to find domains in protein sequences of amino acids. To increase word accuracy in speech recognition, language models are used to capture the information that certain word combinations are more likely than others, thus improving detection based on context. However, to date, these context techniques have not been applied to protein domain discovery. Here we show that the application of statistical language modeling methods can significantly enhance domain recognition in protein sequences. As an example, we discover an unannotated Tf_Otx Pfam domain on the cone rod homeobox protein, which suggests a possible mechanism for how the V242M mutation on this protein causes cone-rod dystrophy.

机译：大多数现代语音识别使用概率模型来解释声音序列。隐马尔可夫模型尤其用于识别单词。已采用相同的技术来发现氨基酸的蛋白质序列中的结构域。为了提高语音识别中的单词准确性，使用语言模型来捕获某些单词组合比其他单词更可能出现的信息，从而改善了基于上下文的检测。但是，迄今为止，这些上下文技术尚未应用于蛋白质结构域发现。在这里，我们表明统计语言建模方法的应用可以显着增强蛋白质序列中的域识别。例如，我们在锥杆同源盒蛋白上发现了一个未注释的Tf_Otx Pfam结构域，这提示了该蛋白上的V242M突变如何引起锥杆营养不良的可能机制。

著录项

期刊名称 Proceedings of the National Academy of Sciences of the United States of America
作者
Lachlan Coin; Alex Bateman; Richard Durbin;
展开▼
作者单位

展开▼
年(卷),期 2003(100),8
年度 2003
页码 4516–4520
总页数 5
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. 语言测试中的语音识别技术——在已英语为母语的环境中参加测试重要吗？ [J] . Marina DODIGOVIC 中国应用语言学：英文版 . 2015,第003期
2. Enhanced protein domain discovery by using language modeling techniques from speech recognition. [J] . Coin L, Bateman A, Durbin R Proceedings of the National Academy of Sciences of the United States of America . 2003,第8期

机译：通过使用语音识别中的语言建模技术来增强蛋白质结构域发现。
3. Comparison of Performance of Enhanced Morpheme-based Language Model with Different Word-based Language Models for Improving the Performance of Tamil Speech Recognition System [J] . S. SARASWATHI, T.V. GEETHA ACM transactions on Asian language information processing . 2007,第3期

机译：增强的基于词素的语言模型与不同的基于单词的语言模型的性能比较，以提高泰米尔语语音识别系统的性能
4. Domain Adaptation Based on Mixture of Latent Words Language Models for Automatic Speech Recognition [J] . Ryo MASUMURA, Taichi ASAMI, Takanobu OBA, IEICE transactions on information and systems . 2018,第6期

机译：基于潜在词语言模型混合的领域自适应语音自动识别
5. Evaluation of smoothing techniques for language modeling in automatic filipino speech recognition [C] . Ang Federico M., Ancheta Juan Carlo Miguel C., Francia Karmela Mariz F., 2012 IEEE Region 10 Conference: sustainable development through humanitarian technology. . 2012

机译：自动菲律宾语音识别中的语言建模平滑技术评估
6. Evaluation of speech enhancement techniques for speaker recognition in noisy environments. [D] . El-Solh, Abdel-Aziz. 2006

机译：在嘈杂环境中评估语音增强技术以进行说话人识别。
7. Deep Learning Techniques for Speech Emotion Recognition from Databases to Models [O] . Babak Joze Abbaschian, Daniel Sierra-Sosa, Adel Elmaghraby 2021

机译：语音情感认可的深度学习技术从数据库到模型
8. Enhanced protein domain discovery by using language modeling techniques from speech recognition [O] . Coin, Lachlan, Bateman, Alex, Durbin, Richard 2003

机译：通过使用语音识别中的语言建模技术来增强蛋白质结构域发现
9. Research Into the Use of Speech Recognition Enhanced Microworlds in an Authorable Language Tutor [R] . Plott, B. , Hamilton, A. , Princen, E. , 1999

机译：语言识别中增强微观世界在一位专业语言导师中的应用研究

Enhanced protein domain discovery by using language modeling techniques from speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅