Taming the metadata mess

机译：驯服元数据混乱

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The rapid growth of scientific data shows no sign of abating. This growth has led to a new problem: with so much scientific data at hand, stored in thousands of datasets, how can scientists find the datasets most relevant to their research interests? We have addressed this problem by adapting Information Retrieval techniques, developed for searching text documents, into the world of (primarily numeric) scientific data. We propose an approach that uses a blend of automated and “semi-curated” methods to extract metadata from large archives of scientific data, then evaluates ranked searches over this metadata. We describe a challenge identified during an implementation of our approach: the large and expanding list of environmental variables captured by the archive do not match the list of environmental variables in the minds of the scientists. We briefly characterize the problem and describe our initial thoughts on resolving it.

机译：科学数据的迅速增长没有丝毫减弱的迹象。这种增长导致了一个新问题：手头拥有如此之多的科学数据并存储在成千上万个数据集中，科学家如何才能找到与其研究兴趣最相关的数据集？我们通过将为检索文本文档而开发的信息检索技术改编为（主要是数字的）科学数据世界来解决此问题。我们提出了一种方法，该方法使用自动化和“半策划”方法的混合从大型科学数据档案中提取元数据，然后评估对该元数据进行的排名搜索。我们描述了在实施方法过程中发现的挑战：档案馆捕获的庞大且不断扩大的环境变量列表与科学家们认为的环境变量列表不匹配。我们简要地描述了该问题，并描述了解决该问题的最初想法。

著录项

来源
《2013 IEEE 29th International Conference on data Engineering Workshops》|2013年|286-289|共4页
会议地点 Brisbane(AU);Brisbane(AU);Brisbane(AU);Brisbane(AU);Brisbane(AU);Brisbane(AU);Brisbane(AU);Brisbane(AU);Brisbane(AU)
作者
Megler V.M.;
展开▼
作者单位

Department of Computer Science, Portland State University, Oregon, USAc;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. BRANDIS AND ENCRYPTION: THE METADATA MESS ALL OVER AGAIN [J] . Sam Varghese Exchange . 2017,第JUNa14期

机译：品牌和加密：元数据再次出现
2. Clearing the metadata mess [J] . Neal Romanek TVB Europe . 2014,第jula期

机译：清除元数据混乱
3. Metadata mega mess in Google Scholar [J] . Peter Jacso On-line review . 2010,第1期

机译：Google学术搜索中的元数据大混乱
4. Taming the Metadata Mess [C] . V.M. Megler, Supervised by David Maier International Conference on Data Engineering Workshops . 2013

机译：驯服元数据混乱
5. A Semantic Metadata Enrichment Software Ecosystem (SMESE): Its Prototypes for Digital Libraries, Metadata Enrichments and Assisted Literature Reviews. [D] . Brisebois, Ronald. 2017

机译：语义元数据丰富软件生态系统（SMESE）：其数字图书馆原型，元数据丰富和辅助文献评论。
6. They mess with me I mess with them: Understanding physical aggression in rural girls and boys from methamphetamine-involved families [O] . Wendy Haight, Jane Marshall, Sydney Hans, -1

机译：他们混淆了我我混淆了他们：了解来自甲基苯丙胺的家庭的农村女孩和男孩的身体侵略
7. Taming the Metadata Mess [O] . Megler Veronika Margaret 2013

机译：驯服metadata mess

Taming the metadata mess

摘要

著录项

相似文献

相关主题

期刊订阅