A Science Driven Production Cyberinfrastructure—the Open Science Grid

@article{Altunay2010ASD,
  title={A Science Driven Production Cyberinfrastructure—the Open Science Grid},
  author={Mine Altunay and Paul Avery and Kent Blackburn and Brian Paul Bockelman and Michael Ernst and Dan Fraser and Robert Quick and Robert W. Gardner and Sebastien Goasguen and Tanya Levshina and Miron Livny and John McGee and Doug Olson and Ruth Pordes and Maxim Potekhin and Abhishek Singh Rana and Alain J. Roy and Chander Sehgal and Igor Sfiligoi and Frank W{\"u}rthwein},
  journal={Journal of Grid Computing},
  year={2010},
  volume={9},
  pages={201-218}
}
This article describes the Open Science Grid, a large distributed computational infrastructure in the United States which supports many different high-throughput scientific applications, and partners (federates) with other infrastructures nationally and internationally to form multi-domain integrated distributed systems for science. The Open Science Grid consortium not only provides services and software to an increasingly diverse set of scientific communities, but also fosters a collaborative… 
Commissioning the HTCondor-CE for the Open Science Grid
TLDR
The focus of the HTCondor-CE is not job submission (as in GRAM or CREAM) but resource provisioning, and to deploy this gateway across the OSG, it had to integrate it with the CE configuration, deploy a corresponding information service, coordinate with sites, and overhaul the documentation.
HPC on the Grid: The Theophys Experience
TLDR
This work analyzes the support for parallel jobs in the scientific Grid middlewares, then describes how the community unified the management of most of its computational need using the Grid through the development of a specific project which integrates serial e parallel resources in a common Grid based framework.
Intercontinental Grids: An Infrastructure for Demand-Driven Innovation
TLDR
The paper focuses on the gap between e-Science and e-Social Science arguing that the Grid topology is more compatible with the socio-legal demands of large-scale study of society than more dynamically distributed approaches, such as Cloud Computing.
Solving the Interoperability Problem by Means of a Bus
TLDR
The interoperability problem for a set of different computing infrastructures belonging to a institution in order to solve a computation intensive problem was solved and the solution uses a platform based on a central bus shared by the involved system components for information exchange.
Validation of Grid Middleware for the European Grid Infrastructure
TLDR
The design, development and implementation of this software validation process are detailed, and an analysis is performed on several metrics to evaluate the process impact on the stability of the production infrastructure, by capturing malfunctions and other issues at the initial testing phases.
Towards a Global Service Registry for the World-Wide LHC Computing Grid
TLDR
The intention is to allow the virtual organisation's configuration databases to be decoupled from the underlying information systems in a transparent way and hence simplify any possible future migration due to the evolution of those systems.
DGridSim: a multi-model discrete-event simulator for real-time data grid systems
TLDR
DGridSim is realized to be a multi-model discrete-event simulator, which can model and simulate a variety of different data grid system models by means of several system services and their interactions.
The Evolution of the Pegasus Workflow Management Software
TLDR
The evolution of Pegasus over time is described and motivations behind the design decisions are provided and selected lessons learned are concluded.
Component-based approach for programming and running scientific applications on grids and clouds
TLDR
It is demonstrated how the proposed methodology can be implemented by combining the unique features of the Common Component Architecture (CCA) model together with the H2O resource sharing platform, resulting in the MOCCA component framework.
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 56 REFERENCES
The Open Science Grid
TLDR
The Open Science Grid provides support for and evolution of the infrastructure through activities that cover operations, security, software, troubleshooting, addition of new capabilities, and support for existing and engagement with new communities.
The Open Science Grid
TLDR
The partners in the Open Science Grid Consortium have come together to build a sustainable national production Grid infrastructure in the United States that will be open to scientific collaborations.
Challenges Facing Production Grids
  • R. Pordes
  • Computer Science
    High Performance Computing Workshop
  • 2006
TLDR
The status and challenges of Grids operating in production today are examined, using the Open Science Grid as an example.
Analysis of the current use, benefit, and value of the Open Science Grid
TLDR
A first categorization of OSG value and analysis across several different aspects of the Consortium's goals and activities is presented and some of the upcoming challenges of LHC data analysis ramp up and ongoing contributions to the World Wide LHC Computing Grid are presented.
Interactive and Real-Time Applications on the EGEE Grid Infrastructure
TLDR
This chapter presents the capabilities offered within EGEE for supporting interactive applications and sees that interactivity requires support from multiple layers of a grid architecture starting from the grid middleware itself, the grid site configuration, and the network infrastructure.
Building and testing a production quality grid software distribution for the Open Science Grid
TLDR
This paper describes how the Open Science Grid (OSG) software stack is used to provide a production quality infrastructure for grid sites and users across OSG to run their grid jobs.
Globus Toolkit Version 4: Software for Service-Oriented Systems
TLDR
The principal characteristics of the latest release, the Web services-based GT4, which provides significant improvements over previous releases in terms of robustness, performance, usability, documentation, standards compliance, and functionality are summarized.
Distributed computing in practice: the Condor experience
TLDR
The history and philosophy of the Condor project is provided and how it has interacted with other projects and evolved along with the field of distributed computing is described.
glideinWMS - A generic pilot-based Workload Management System
TLDR
This paper contains the structural overview of glideinWMS as well as a detailed description of the current implementation and the current scalability limits.
...
1
2
3
4
5
...