Supporting scientific discovery processes in Discovery Net

  title={Supporting scientific discovery processes in Discovery Net},
  author={Jameel Syed and Moustafa M. Ghanem and Yike Guo},
  journal={Concurrency and Computation: Practice and Experience},
  • J. SyedM. GhanemYike Guo
  • Published 1 February 2007
  • Computer Science
  • Concurrency and Computation: Practice and Experience
The activity of e‐Science involves making discoveries by analysing data to find new knowledge. Discoveries of value cannot be made by simply performing a pre‐defined set of steps to produce a result. Rather, there is an original, creative aspect to the activity that by its nature cannot be automated. In addition to finding new knowledge, discovery therefore also concerns finding a process to find new knowledge. How discovery processes are modelled is therefore key to effectively practicing e… 

The design and implementation of a workflow analysis tool

  • V. CurcinM. GhanemYike Guo
  • Computer Science
    Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences
  • 2010
This article introduces a framework and a set of associated methods for analysing the execution properties of scientific workflows, and uses the framework to design the architecture of a customizable tool that can be used to analyse the key execution properties at authoring stage.

Polymorphic type framework for scientific workflows with relational data model

The focus is on the relational data model, popular in data analysis workflow systems, and the techniques introduced are validated by applying the inference engine prototype to an adverse drug reaction study implemented in the relational algebra subset of the Discovery Net workflow system.

Open workflow infrastructure: a research agenda

A research agenda to build an infrastructure with a flexible design to work on an Internet-wide scale to incorporate workflow editing, sharing and enactment capabilities directly into the Internet, thus making distributed applications available and usable in a wide range of pervasive settings.

Service-Oriented Software in the Humanities: A Software Engineering Perspective

  • N. Gold
  • Computer Science
    Digit. Humanit. Q.
  • 2009
This paper examines, from the perspective of a software engineer relatively new to the digital humanities, how the recent developments in service-oriented architectures could be used to enable new approaches to digital enquiry in the arts and humanities.

Big Data in High Performance Scientific Computing

High performance scientific computing is located at the intersection of a number of scientific disciplines and skills sets, in the sense that it requires at least basic knowledge and skills in these scientific fields.

Cloud Computing for e-Sciences at Université Sorbonne Paris Cité

The outcome of the paper is a methodology for accompanying adequate technological choices and acceptance by the users of the cultural changes when they migrate to cloud technologies.

Data Integration in the Life Sciences: Fun, Findings and Frustrations

This paper presents no technical results, but rather provides a classification of research activities in terms of the contributions they seek to make to the life sciences, bioinformatics or computer science.

Sensor Grid Enhancement with Data Management System for Ubiquitous Healthcare Computing

This chapter proposes the SEnsor Grid Enhancement Data Management system, called SEGEDMA, ensuring the integration of different network technologies and the continuous data access to system users and the interoperability of Open Geospatial Consortium and HL7 standards.

The National Weather Sensor Grid: a large-scale cyber-sensor infrastructure for environmental monitoring

A sensor grid architecture framework, called the Scalable Proxy-based aRchItecture for seNsor Grid (SPRING), based on which the National Weather Sensor Grid (NWSG) is designed, a large-scale cyber-sensor infrastructure for environmental monitoring.

Ubiquitous Healthcare Computing with Sensor Grid Enhancement with Data Management System (SEGEDMA)

The SEnsor Grid Enhancement Data Management system, called SEGEDMA, is proposed ensuring the integration of different network technologies and the continuous data access to system users ensuring also the interoperability of Open Geospatial Consortium (OGC) and HL7 standards.



Discovery net: towards a grid of knowledge discovery

This paper shows how this architecture will behave during a typical KDD process design and deployment, how it enables the execution of complex and distributed data mining tasks with high performance and how it provides a community of e- scientists with means to collaborate, retrieve and reuse both KDD algorithms, discovery processes and knowledge in a visual analytical environment.

The Design of Discovery Net: Towards Open Grid Services for Knowledge Discovery

The architecture is built on top of standard protocols and standard infrastructures but also defines its own protocols such as the Discovery Process Mark-up Language for data flow management and evaluates it by building a real-time genome annotation environment on top.

Discovery Processes: Representation And Re-Use

This work states that where successful processes need to be automated, the traditional bioinformatics approach has been to create bespoke applications using scripting languages such as Perl to define service composition for execution.

The {my}Grid Project: Services, Architecture and Demonstrator

The ultimate goal of Grid is to supply this collection of services as a toolkit to build end applications, and the project is building its own application (the Grid workBench).

Provenance of e-Science Experiments - Experience from Bioinformatics

An overview of initial work on the provenance of bioinformatics e-Science experiments within myGrid uses two kinds of provenance: the derivation path of information and annotation and explores how the resulting Webs of experimental data holdings can be mined for useful information and presentations for the e-Scientist.

Business process execution language for web services

This book focuses on executable processes and comes back to abstract processes in Chapter 4, which can be used to replace sets of rules usually expressed in natural language, which is often ambiguous.

Knowledge Discovery and Data Mining: Towards a Unifying Framework

The KDD process and basic data mining algorithms are defined, links between data mining, knowledge discovery, and other related fields are described, and an analysis of challenges facing practitioners in the field is analyzed.

ICENI: An Integrated Grid Middleware to Support E-Science

This chapter describes ICENI, an integrated Grid middleware that explores the services and meta-data necessary to support e-research within a variety of application domains.

Distributed computing with Triana on the Grid

In this paper, we describe Triana, a distributed problem‐solving environment that makes use of the Grid to enable a user to compose applications from a set of components, select resources on which