...
首页> 外文期刊>OASIcs : OpenAccess Series in Informatics >Learning a Better Motif Index: Toward Automated Motif Extraction
【24h】

Learning a Better Motif Index: Toward Automated Motif Extraction

机译:学习更好的母题索引:实现自动母题提取

获取原文
           

摘要

Motifs are distinctive recurring elements found in folklore, and are used by folklorists to categorize and find tales across cultures and track the genetic relationships of tales over time. Motifs have significance beyond folklore as communicative devices found in news, literature, press releases, and propaganda that concisely imply a large constellation of culturally-relevant information. Until now, folklorists have only extracted motifs from narratives manually, and the conceptual structure of motifs has not been formally laid out. In this short paper we propose that it is possible to automate the extraction of both existing and new motifs from narratives using supervised learning techniques and thereby possible to learn a computational model of how folklorists determine motifs. Automatic extraction would enable the construction of a truly comprehensive motif index, which does not yet exist, as well as the automatic detection of motifs in cultural materials, opening up a new world of narrative information for analysis by anyone interested in narrative and culture. We outline an experimental design, and report on our efforts to produce a structured form of Thompson's motif index, as well as a development annotation of motifs in a small collection of Russian folklore. We propose several initial computational, supervised approaches, and describe several possible metrics of success. We describe lessons learned and difficulties encountered so far, and outline our plan going forward.
机译:主题是在民间传说中发现的与众不同的重复元素,民俗学家使用这些主题对各种文化的故事进行分类和查找,并随着时间的推移跟踪故事的遗传关系。主题具有超越民间文学艺术的意义,作为在新闻,文学,新闻稿和宣传中发现的交流手段,简而言之意味着大量与文化相关的信息。到目前为止,民俗学家只是从叙事中手动提取主题,而主题的概念结构尚未正式布局。在这篇简短的论文中,我们建议有可能使用监督学习技术从叙事中自动提取现有和新的主题,从而有可能学习民俗学家如何确定主题的计算模型。自动提取将能够构建尚不存在的真正全面的主题索引,并能够自动检测文化材料中的主题,从而开辟了叙事信息的新世界,可供对叙事和文化感兴趣的任何人进行分析。我们概述了一项实验性设计,并报告了我们为制作汤普森主题索引的结构化形式所做的努力,以及在少量俄罗斯民间文学艺术中对主题的发展注解。我们提出了几种初始的计算,监督方法,并描述了几种可能的成功指标。我们描述了到目前为止的经验教训和遇到的困难,并概述了我们的计划。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号