首页> 外文OA文献 >Evidence Summary: Music Information Seeking Behaviour Poses Unique Challenges for the Design of Information Retrieval Systems
【2h】

Evidence Summary: Music Information Seeking Behaviour Poses Unique Challenges for the Design of Information Retrieval Systems

机译:证据摘要:音乐信息搜寻行为对信息检索系统的设计提出了独特的挑战

摘要

Objective – To better understand music information seeking behaviour in a real life situation and to create a taxonomy relating to this behaviour to facilitate better comparison of music information retrieval studies in the future.ududDesign – Content analysis of natural language queries.ududSetting – Google Answers, a fee based online service.ududSubjects – 1,705 queries and their related answers and comments posted in the music category of the Google Answers website before April 27, 2005.ududMethods – A total of 2,208 queries were retrieved from the music category on the Google Answers service. Google Answers was a fee based service in which users posted questions and indicated what they were willing to pay to have them answered. udThe queries selected for this study were posted prior to April 27, 2005, over a year before the service was discontinued completely. Of the 2208 queries taken from the site, only 1,705 were classified as relevant to the question of music information seeking by the researcher. The off-topic queries were not included in the study.ududEach of the 1,705 queries was coded according to the needs expressed by the user and the information provided to assist researchers in answering the question. The initial coding framework used by the researcher was informed by previous studies of music information retrieval to facilitate comparison, but was expanded and revised to reflect the evidence itself. ududOnly the questions themselves were subjected to this iterative coding process. The answers provided by the Google Answer researchers and online comments posted by other users were examined by the author, but not coded for inclusion in the study.udUser needs in the questions were coded for their form and topic. Each question was assigned at least one form and one topic. Form refers to the type of question being asked and consisted of the following 10 categories: identification, location, verification, recommendation, evaluation, ready reference, reproduction, description, research, and other. Reproduction in this context is defined as “questions asking for text” and referred most often to questions looking for song lyrics, while evaluation typically meant the user was seeking reviews of works (p. 1029). Sixteen question topics were outlined in the coding framework. They included lyrics, translation, meaning (i.e., of lyrics), score, work, version, recording (e.g., where is an album available for purchase), related work, genre, artist, publisher, instrument, statistics, background (e.g. definitions), resource (i.e. sources of music information) and other.ududThe questions were also coded for their features or the information provided by the user. The final coding framework outlined 57 features, some of which were further subdivided by additional attributes. For example, a feature with attributes was title. The researcher further clarified the attribute of title by indicating whether the user mentioned the title of a musical work, recording, printed material or related work in their question. More than one feature could appear in a user query.ududMain Results – Overall, the most common questions posted on the Google Answers service relating to music involved identifying works or artists, finding recordings, or retrieving lyrics. The most popular query forms were identification (43.8%), location (33.3%), and reproduction (10.9%). The most common topics were work (49.1%), artist (36.4%), recording (16.7%), and lyrics (10.4%). The most common features provided by users in their posted questions were person name (53%), title (50.9%), date (45.6%), genre (37.2%), role (33.8%), and lyric (27.6%). The person name usually referred to an artist’s name (in 95.6% of cases) and title most often referred to the title of a musical work. Another feature that appeared in 25.6% of queries was place reference, almost half of which referred to the place where the user encountered the music they were enquiring about. While the coding framework eventually encompassed 57 different features, a small number of features dominated, with seven features used in over 25% of the queries posted and 33 features appearing in less than 10%. The seven most common features were person name, title, date, genre, role, lyric, and place reference.ududLee categorized most of the queries as “known-item searches,” even though at times users provided incorrect information and many were looking for information about the musical item but not the item itself (p. 1035). Other interesting features identified by the author were the presence of “dormant searches,” long standing questions a user had about a musical item, sometimes for years, which were reawakened by hearing the song again or other events (p. 1037). Multiple versions of musical works and the provision of information gleaned third hand by users were also identified as complicating factors in correctly meeting musical information needs.ududConclusion – While certain types of questions dominated among music queries posted on the Google Answers service, there were a wide variety of music information needs expressed by users. In some cases, the features provided by the user as clues to answering the query were very personal, and related to the context in which they encountered the work or the mood a particular work or artist evoked. Such circumstances are not currently or adequately covered by existing bibliographic record standards, which focus on qualities inherent in the music itself. The author suggests that user context should play a greater role in the testing and development of music information retrieval systems, although the instability and variability of this type of information is acknowledged. In some cases this context could apply to other works (film, television, etc.) in which a musical work is featured. Another potential implication for music information retrieval system development is a need to re-evaluate the terminology employed in testing to ensure that it is the language most often employed by users. For example, the 128 different terms used in this study to describe how a musical item made the user feel did not significantly overlap with terms employed in a previous music information retrieval task involving mood classification conducted through MIREX, the Music Information Retrieval Evaluation Exchange, in 2007. The author also argues that while most current music information retrieval testing is task-specific – e.g., how can a user search for a particular work by humming a few bars or searching for a work based on its genre, in real life, users come to their search with information that is not neatly parsed into separate tasks. The study affirms a need for systems that can combine tasks and/or consolidate the results of separate tasks for users.
机译:目标–更​​好地了解现实生活中的音乐信息搜索行为,并创建与该行为有关的分类法,以便将来更好地比较音乐信息检索研究。 ud udDesign –自然语言查询的内容分析。 ud udSetting –收费的在线服务Google Answers。 ud udSubjects – 2005年4月27日之前在Google Answers网站的音乐类别中发布的1,705条查询及其相关答案和评论。 ud udMethods –总计2,208查询是从Google解答服务的音乐类别中检索到的。 Google解答是一项收费服务,用户可以在其中发布问题并指出愿意为回答这些问题而支付的费用。 ud为这项研究选择的查询是在2005年4月27日之前发布的,也就是在该服务完全终止之前的一年。从该站点进行的2208个查询中,只有1705个被分类为与研究人员寻求的音乐信息有关。偏离主题的查询未包含在研究中。 ud ud根据用户表达的需求和所提供的信息来帮助研究人员回答问题,对1,705条查询中的每条进行了编码。研究人员最初使用的编码框架是在先前音乐信息检索研究的基础上提供信息的,以便于进行比较,但后来对其进行了扩展和修订以反映证据本身。 ud ud仅问题本身受此迭代编码过程的影响。作者对Google解答研究人员提供的答案和其他用户发布的在线评论进行了审查,但未编码为包含在研究中。 ud对用户需求的形式和主题进行了编码。每个问题至少分配了一种形式和一个主题。表格是指要提出的问题类型,它由以下10个类别组成:标识,位置,验证,推荐,评估,参考,复制,描述,研究及其他。在这种情况下,复制被定义为“询问文本的问题”,并且最常指代寻找歌曲歌词的问题,而评估通常意味着用户正在寻求作品的评论(第1029页)。编码框架中概述了十六个问题主题。它们包括歌词,翻译,含义(即歌词),乐谱,作品,版本,唱片(例如,可以在哪里购买专辑),相关作品,类型,艺术家,发行者,乐器,统计数据,背景(例如定义) ),资源(即音乐信息的来源)和其他。 ud ud问题的代码也针对其功能或用户提供的信息进行了编码。最终的编码框架概述了57个功能,其中一些功能进一步被附加属性细分。例如,具有属性的功能是标题。研究人员通过指示用户是否在其问题中提到音乐作品,唱片,印刷材料或相关作品的标题来进一步明确标题的属性。用户查询中可能会出现多个功能。 ud ud主要结果-总体而言,在Google解答服务上发布的与音乐有关的最常见问题涉及识别作品或艺术家,查找录音或检索歌词。最受欢迎的查询形式是识别(43.8%),位置(33.3%)和再现(10.9%)。最常见的主题是作品(49.1%),艺术家(36.4%),唱片(16.7%)和歌词(10.4%)。用户在发布的问题中提供的最常见功能是姓名(53%),职务(50.9%),日期(45.6%),体裁(37.2%),角色(33.8%)和歌词(27.6%)。人名通常是指艺术家的名字(占案件的95.6%),标题通常是指音乐作品的标题。 25.6%的查询中出现的另一个功能是位置参考,其中几乎有一半是用户遇到他们正在查询的音乐的位置。虽然编码框架最终包含了57个不同的功能,但少数功能占主导地位,超过25%的查询中使用了7个功能,而少于10%的查询中包含了33个功能。 ud udLee将七个查询归类为“已知项搜索”,尽管有时用户提供的信息不正确,并且许多信息不正确,但最常见的七个功能是人名,标题,日期,体裁,角色,歌词和位置参考。正在寻找有关音乐项目的信息,而不是该项目本身的信息(第1035页)。作者确定的其他有趣功能是“休眠搜索”的存在,这是用户长期以来对音乐项目的疑问,有时甚至长达数年之久。,通过再次听到歌曲或其他事件将其唤醒(第1037页)。音乐作品的多种版本以及用户收集的第三手信息也被认为是正确满足音乐信息需求的复杂因素。 ud ud结论–尽管某些类型的问题在Google问答服务上发布的音乐查询中占主导地位,用户表达了各种各样的音乐信息需求。在某些情况下,用户提供的作为回答查询线索的功能非常个人化,并且与他们遇到作品的环境或特定作品或艺术家引起的心情有关。现有书目记录标准当前或未充分涵盖这种情况,该标准着重于音乐本身固有的质量。作者建议用户上下文应在音乐信息检索系统的测试和开发中发挥更大的作用,尽管这种类型的信息的不稳定性和可变性已得到认可。在某些情况下,此上下文可以应用于以音乐作品为特色的其他作品(电影,电视等)。音乐信息检索系统开发的另一个潜在含义是需要重新评估测试中使用的术语,以确保它是用户最常使用的语言。例如,本研究中用于描述音乐项目如何使用户感觉到的128个不同术语与之前通过MIREX(音乐信息检索评估交易所)进行的涉及情绪分类的先前音乐信息检索任务中使用的术语没有明显重叠。 2007年。作者还指出,尽管当前大多数音乐信息检索测试都是针对特定任务的-例如,用户如何通过哼唱一些小节或根据其流派搜索作品来搜索特定作品,但在现实生活中,用户进行搜索时会使用没有整齐地解析为单独任务的信息。该研究肯定了对于可以组合任务和/或为用户合并单独任务的结果的系统的需求。

著录项

  • 作者

    Merkley Cari;

  • 作者单位
  • 年度 2010
  • 总页数
  • 原文格式 PDF
  • 正文语种 en
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号