The Role of Provenance Management in Accelerating the Rate of Astronomical Research

@article{Berriman2010TheRO,
  title={The Role of Provenance Management in Accelerating the Rate of Astronomical Research},
  author={G. Bruce Berriman and Ewa Deelman},
  journal={ArXiv},
  year={2010},
  volume={abs/1005.3358}
}
The availability of vast quantities of data through electronic archives has transformed astronomical research. It has also enabled the creation of new products, models and simulations, often from distributed input data and models, that are themselves made electronically available. These products will only provide maximal long-term value to astronomers when accompanied by records of their provenance; that is, records of the data and processes used in the creation of such products. We use the… 
2 Citations

Figures and Tables from this paper

Cloud computing in the age of data-intensive science
TLDR
To examine the cost and efficiency of generating new products, new image sets were created from data uploaded to the Amazon Elastic Compute 2 (EC2) cloud and its performance was compared with that of the Abe cluster at the National Center for Supercomputing Applications.
ReputationNet: Reputation-Based Service Recommendation for e-Science
TLDR
ReputationNet is proposed as an enhancement of ServiceMap, to incorporate the often-ignored reputation aspects of services/workflows and their publishers, in order to offer better service and workflow recommendations.

References

SHOWING 1-9 OF 9 REFERENCES
The Open Provenance Model
TLDR
The first Provenance Challenge was set up in order to provide a forum for the community to understand the capabilities of provenance systems and the expressiveness of their provenance representations, and was followed by the second challenge, aimed at establishing inter-operability of systems, by exchanging provenance information.
Provenance Tracking in an Earth Science Data Processing System
TLDR
Science Data Processing Systems should capture, archive, and distribute provenance information of all externally received data and algorithms, as well as describing all internal processes used for data transformation.
Scientific Data Management - Challenges, Technology, and Deployment
TLDR
This book provides a comprehensive understanding of the latest techniques for managing data during scientific exploration processes, from data generation to data analysis.
Oceanographic Data Provenance Tracking with the Shore Side Data System
TLDR
The provenance tracking aspects of the MBARI Shore Side Data System are described and the lessons learned from its implementation in an operational environment are described.
Pipeline-centric provenance model
TLDR
A new provenance model which is tailored to a class of workflow-based applications, and the benefits in terms of storage needed by the approach when applied to an astronomy application is evaluated.
Pegasus: A framework for mapping complex scientific workflows onto distributed systems
TLDR
The results of improving application performance through workflow restructuring which clusters multiple tasks in a workflow into single entities are presented.
Pipeline-Centric Provenance Mode.l" Paper accepted for publication at Supercomputing 09
  • Pipeline-Centric Provenance Mode.l" Paper accepted for publication at Supercomputing 09
  • 2009
Scientific Data Management: Challenges, Existing Technology, and Deployment. Arie Shoshani and Doron Rotem, Editor
  • Scientific Data Management: Challenges, Existing Technology, and Deployment. Arie Shoshani and Doron Rotem, Editor
  • 2009