Metadata Management in the Taverna Workflow System

@article{Belhajjame2008MetadataMI,
  title={Metadata Management in the Taverna Workflow System},
  author={Khalid Belhajjame and Katy Wolstencroft and {\'O}scar Corcho and Thomas M. Oinn and Franck Tanoh and Alan R. Williams and Carole A. Goble},
  journal={2008 Eighth IEEE International Symposium on Cluster Computing and the Grid (CCGRID)},
  year={2008},
  pages={651-656}
}
There seems to be a general consensus on the crucial role metadata can play for enhancing the functionalities of scientific workflows systems, e.g., workflow and service discovery, composition and provenance browsing, among others. However, in most cases their management is under-specified, if not left unaddressed at all. A step in this direction, the main contribution of the work presented in this paper is an overview of metadata and their management in the Taverna workflow system. In Taverna… 

Figures from this paper

An Intermediate Data-driven Methodology for Scientific Workflow Management System to Support Reusability
TLDR
This thesis proposes an intermediate data management scheme for a Scientific Workflow Management System (SWfMS) and proposes an adaptive version of the technique by considering the states of tools in a SWfMS, which shows around 40% reusability for workflows.
Provenance Collection in Reservoir Management Workflow Environments
TLDR
This paper presents an approach that uses the workflow instances detection algorithm and the Open Provenance Model (OPM) for capturing provenance information from the logs for reservoir engineering.
Metadata Workflows Across Research Domains: Challenges and Opportunities for Supporting the DFC Cyberinfrastructure
TLDR
This poster presents research results from a survey studying metadata workflows, finding where are people (and automated processes) creating metadata in the data life cycle, and what could be done to improve the quality.
Semantics-enriched workflow creation and management system with an application to document image analysis and recognition
TLDR
How concepts of the Semantic Web can be used to augment workflow systems in order to assist researchers as well as non-expert users in creating valid and effective workflows is explored.
Semantic-Based Scientific Workflow Composition
TLDR
This chapter presents a novel semantic-based approach which features separation of concerns between data semantics and data representation, and between Activity Function (AF) and activity type (AT) (semantic description of workflow activities) and Activity Type (AT), which simplifies scientific workflow composition.
Computer-Assisted Scientific Workflow Design
TLDR
This work addresses the problem of data-intensive Scientific Workflow design by describing scientific experiments at a higher level of abstraction, emphasizing scientific concepts over technicalities, easing the separation of functional and non-functional concerns and leveraging domain knowledge.
A TAXONOMY ON TOOLS FOR SCIENTIFIC WORKFLOW MANAGEMENT SYSTEM
TLDR
There is as of now much concern in scientific domain to automation distributed experiments to automate distributed experiments.
Mechanisms for provenance collection in scientific workflow systems
TLDR
A set of design dimensions and conventions for provenance collection mechanisms in the context of working on scientific workflow systems are identified and defined and used in order to evaluate a number of existing provenancecollection mechanisms.
A Worklow Management Platform for Scientific Applications in Grid Environments
TLDR
PEGAF is presented, a workflow management platform for distributed systems targeted at scientific applications that provides features like an intuitive way to describe workflows and flexible fault tolerance support, while integrating both workflow orchestration and meta-scheduling.
Abstract Workflow Description Language
TLDR
This chapter presents the Abstract Workflow Description Language (AWDL) for describing scientific workflow applications at a high level of abstraction, and describes a material science workflow that has been successfully ported to a distributed infrastructure based on the AWDL specification.
...
...

References

SHOWING 1-10 OF 12 REFERENCES
Provenance Collection Support in the Kepler Scientific Workflow System
TLDR
A complete framework for data and process provenance in the Kepler Scientific Workflow System is described and how generic provenance capture can be facilitated in Kepler's actor-oriented workflow environment is introduced.
Semantic Metadata Generation for Large Scientific Workflows
TLDR
Novel metadata reasoning capabilities that have been developed are used to support the creation of large executable workflows in an earthquake science application with more than 7,000 jobs, generating metadata for more than 100,000 new files.
Knowledge Discovery for Biology with Taverna
TLDR
The myGrid project has the potential to integrate and aggregate workflow outcomes, and reason over provenance logs to identify new experimental insights, and to build and export a Semantic Web of experiments that contributes to Knowledge Discovery for Taverna users and for the scientific community as a whole.
Automatic annotation of Web services based on workflow definitions
TLDR
It is shown that despite their imprecise nature these so-called loose annotations are still of value in supporting the manual annotation task, inspecting workflows and discovering services, and that derived annotations for already annotated parameters are useful.
Actor-Oriented Design of Scientific Workflows
TLDR
This work develops a formal model for scientific workflows based on an actor-oriented modeling and design approach, originally developed for studying models of complex concurrent systems, and extends this framework by introducing a novel hybrid type system.
The myGrid ontology: bioinformatics service discovery
TLDR
The (my)Grid ontology is one component in a larger semantic discovery framework for the identification of the highly distributed and heterogeneous bioinformatics services in the public domain and adopt a spectrum of expressivity and reasoning for different tasks in service annotation and discovery.
Feta: A Light-Weight Architecture for User Oriented Semantic Service Discovery
TLDR
This paper describes the requirements from the bioinformatics domain which demand technically simpler descriptions, involving the user community at all levels, and describes the data model and light-weight semantic discovery architecture.
Requirements and Services for Metadata Management
TLDR
This paper identifies general requirements for metadata management and describes a simple model and service that focuses on RDF metadata to address these requirements.
Metadata and the world wide web
TLDR
This article considers metadata in today’s Web environment, defines metadata, examines the relationship between metadata and cataloging, provides definitions for key metadata vocabulary terms, and explores the topic of metadata generation.
An approach to support Web service classification and annotation
TLDR
Results obtained classifying a set of Web services show that the automatically classify services to specific domains and identify key concepts inside service textual documentation, and builds a lattice of relationships between service annotations.
...
...