• Publications
  • Influence
BIRCH: an efficient data clustering method for very large databases
TLDR
A data clustering method named BIRCH (Balanced Iterative Reducing and Clustering using Hierarchies) is presented, and it is demonstrated that it is especially suitable for very large databases. Expand
Condor-a hunter of idle workstations
TLDR
The design, implementation, and performance of the Condor scheduling system, which operates in a workstation environment, are presented and a performance profile of the system is presented that is based on data accumulated from 23 stations during one month. Expand
Distributed computing in practice: the Condor experience
TLDR
The history and philosophy of the Condor project is provided and how it has interacted with other projects and evolved along with the field of distributed computing is described. Expand
Condor-G: A Computation Management Agent for Multi-Institutional Grids
TLDR
It is asserted that Condor-G can serve as a general-purpose interface to Grid resources, for use by both end users and higher-level program development tools. Expand
Matchmaking: distributed resource management for high throughput computing
TLDR
The classified advertisement (classad) matchmaking framework is developed and implemented, a flexible and general approach to resource management in distributed environment with decentralized ownership of resources. Expand
BIRCH: A New Data Clustering Algorithm and Its Applications
TLDR
An efficient and scalable data clustering method is proposed, based on a new in-memory data structure called CF-tree, which serves as an in- memory summary of the data distribution, and implemented in a system called BIRCH (Balanced Iterative Reducing and Clustering using Hierarchies), and compared with other available methods. Expand
Pegasus, a workflow management system for science automation
TLDR
An integrated view of the Pegasus system is provided, showing its capabilities that have been developed over time in response to application needs and to the evolution of the scientific computing platforms. Expand
Pegasus: Mapping Scientific Workflows onto the Grid
TLDR
The Pegasus system that can map complex workflows onto the Grid and takes an abstract description of a workflow and finds the appropriate data and Grid resources to execute the workflow is described. Expand
Condor and the Grid
TLDR
Ready access to large amounts of computing power could be achieved inexpensively with collections of small devices rather than expensive single supercomputers in the 1970s. Expand
Examining the Challenges of Scientific Workflows
TLDR
A recent National Science Foundation workshop brought together domain, computer, and social scientists to discuss requirements of future scientific applications and the challenges they present to current workflow technologies. Expand
...
1
2
3
4
5
...