首页    期刊浏览 2024年12月02日 星期一
登录注册

文章基本信息

  • 标题:Golden Trail: Retrieving the Data History that Matters from a Comprehensive Provenance Repository
  • 本地全文:下载
  • 作者:Paolo Missier ; Bertram Ludäscher ; Saumen Dey
  • 期刊名称:International Journal of Digital Curation
  • 印刷版ISSN:1746-8256
  • 出版年度:2012
  • 卷号:7
  • 期号:1
  • 页码:139-150
  • DOI:10.2218/ijdc.v7i1.211
  • 语种:English
  • 出版社:University of Edinburgh
  • 摘要:Experimental science can be thought of as the exploration of a large research space, in search of a few valuable results. While it is this “Golden Data” that gets published, the history of the exploration is often as valuable to the scientists as some of its outcomes. We envision an e-research infrastructure that is capable of systematically and automatically recording such history – an assumption that holds today for a number of workflow management systems routinely used in e-science. In keeping with our gold rush metaphor, the provenance of a valuable result is a “Golden Trail”. Logically, this represents a detailed account of how the Golden Data was arrived at, and technically it is a sub-graph in the much larger graph of provenance traces that collectively tell the story of the entire research (or of some of it). In this paper we describe a model and architecture for a repository dedicated to storing provenance traces and selectively retrieving Golden Trails from it. As traces from multiple experiments over long periods of time are accommodated, the trails may be sub-graphs of one trace, or they may be the logical representation of a virtual experiment obtained by joining together traces that share common data. The project has been carried out within the Provenance Working Group of the Data Observation Network for Earth (DataONE) NSF project. Ultimately, our longer-term plan is to integrate the provenance repository into the data preservation architecture currently being developed by DataONE.
国家哲学社会科学文献中心版权所有