首页> 外文会议>Machine learning and data mining in pattern recognition >Unsupervised Grammar Inference Using the Minimum Description Length Principle
【24h】

Unsupervised Grammar Inference Using the Minimum Description Length Principle

机译:使用最小描述长度原理的无监督语法推理

获取原文
获取原文并翻译 | 示例

摘要

Context Free Grammars (CFGs) are widely used in programming language descriptions, natural language processing, compilers, and other areas of software engineering where there is a need for describing the syntactic structures of programs. Grammar inference (GI) is the induction of CFGs from sample programs and is a challenging problem. We describe an unsupervised GI approach which uses simplicity as the criterion for directing the inference process and beam search for moving from a complex to a simpler grammar. We use several operators to modify a grammar and use the Minimum Description Length (MDL) Principle to favor simple and compact grammars. The effectiveness of this approach is shown by a case study of a domain specific language. The experimental results show that an accurate grammar can be inferred in a reasonable amount of time.
机译:上下文无关文法(CFG)广泛用于编程语言描述,自然语言处理,编译器以及需要描述程序语法结构的其他软件工程领域。语法推断(GI)是来自示例程序的CFG的归纳,是一个具有挑战性的问题。我们描述了一种无监督的GI方法,该方法使用简单性作为指导推理过程的准则,并使用波束搜索从复杂的语法过渡到更简单的语法。我们使用多个运算符来修改语法,并使用最小描述长度(MDL)原理来支持简单而紧凑的语法。特定领域语言的案例研究表明了这种方法的有效性。实验结果表明,可以在合理的时间内推断出准确的语法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号