首页> 外文期刊>The Electronic Library >Retrieval of bibliographic records using Apache Lucene
【24h】

Retrieval of bibliographic records using Apache Lucene

机译:使用Apache Lucene检索书目记录

获取原文
获取原文并翻译 | 示例
           

摘要

Purpose - The aim of the research is modeling and implementing a software component for thernretrieval of bibliographic records using the Apache Lucene retrieval engine.rnDesign/methodology/approach - Object-oriented methodology is used for modeling andrnimplementation of the bibliographic record retrieval engine. Modeling is carried out in the CASErntool that supports the unified modeling language (UML 2.0), while the implementation is using the Javarnprogramming language and open source components.rnFindings - The result is a software component for the retrieval of bibliographic records that arernindependent of the bibliographic format used in cataloging. It features great flexibility in terms ofrnconfiguring search types without the need to change the software implementation.rnResearch limitations/implications - One of the constraints of this system relates to the problemrnof searching linking entry fields. UNEVLARC format defines fields used to link the item being catalogedrnto another bibliographic item, so those fields may contain other fields, which can be termed secondaryrnfields. In this proposed solution, secondary fields are treated as all other fields and there is norninformation whether the search term belongs to the secondary or a regular field.rnPractical implications - The proposed solution is integrated into library information systemrnBISIS, version 4. This version of the BISIS system is in use at university, public and special libraries.rnBy introducing this version, system performance as well as flexibility of the indexing process arernimproved and at the same time librarians are able to perform sophisticated and effective retrieval ofrnbibliographic records.rnOriginality/value - The contribution of this work is in the design of a customizable record retrievalrncomponent. It is configured by means of an XML document for specifying mapping rules betweenrnsubfields of the bibliographic record format and search types. By using XML it is possible to add newrnmapping rules without additional programming. In addition, great attention has been paid to thernindexing of subfields that contain punctuation marks having special semantic meanings for librariansrnand the transliteration between Cyrillic and Latin scripts. Also, originality of this work lies in using thernApache Lucene search engine, which facilitates building highly flexible and efficient retrieval systems.
机译:目的-该研究的目的是使用Apache Lucene检索引擎对书目记录进行检索的软件组件的建模和实现。设计/方法/方法-面向对象的方法用于书目记录检索引擎的建模和实现。在支持统一建模语言(UML 2.0)的CASErntool中进行建模,而实现则使用Javarn编程语言和开放源代码组件。rnFindings-结果是一个软件组件,用于检索与书目无关的书目记录。编录中使用的格式。它在配置搜索类型方面具有极大的灵活性,而无需更改软件实现。研究限制/含义-该系统的约束之一与问题搜索链接条目字段有关。 UNEVLARC格式定义用于将正在分类的项目链接到另一个书目项目的字段,因此这些字段可能包含其他字段,这些字段可以称为secondaryfields。在此提议的解决方案中,次要字段被视为所有其他字段,并且没有搜索信息属于次要字段还是常规字段的信息。rn实际意义-提议的解决方案已集成到图书馆信息系统rnBISIS,版本4中。 BISIS系统已在大学,公共图书馆和特殊图书馆中使用。通过引入此版本,系统的性能以及索引编制过程的灵活性得到了改善,同时图书馆管理员能够执行复杂而有效的书目记录检索。这项工作的贡献在于可自定义记录检索组件的设计。它通过XML文档进行配置,用于指定书目记录格式的子字段和搜索类型之间的映射规则。通过使用XML,可以添加新的映射规则,而无需进行其他编程。另外,已经高度重视包含标点符号的子字段的索引,这些标点符号对于图书馆员具有特殊的语义含义,并且在西里尔文字和拉丁文字之间进行音译。同样,这项工作的独创性在于使用Apache Lucene搜索引擎,该引擎有助于构建高度灵活和高效的检索系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号