Learn More
Checkpointing of parallel applications can be used as the core technology to provide process migration. Both, check-pointing and migration, are an important issue for parallel applications on networks of workstations. The CoCheck environment which we present in this paper introduces a new approach to provide checkpointing and migration for parallel(More)
Checkpoints cannot only be used to increase fault tolerance, but also to migrate processes. The migration is particularly useful in workstation environments where machines become dynamically available and unavailable. We introduce the CoCheck environment which not only allows the creation of checkpoints, but also provides process migration. The creation of(More)
Writing parallel programs eeciently is still a problematic issue due to the lack of a uniform and integrated tool support. A new project at LRR-TUM works towards changing this situation for the most popular programming environment on workstation clusters: for PVM. The Tool-set for PVM will comprise a set of integrated tools, consisting of a debugger, a(More)
Running parallel applications in a network of workstations (NOW) requires the use of a resource management system with batch queueing and load balancing functionalities to utilize idle workstations in the NOW and to avoid load imbalance in the network. A resource management system for parallel jobs requires special func-tionalities to schedule jobs to hosts(More)
Workstations provide an attractive platform to develop software. This paper describes the NXLib environment which enables users to use this platform to develop parallel applications. The source code compatibility of NXLib applications with Intel's Paragon multicomputer system easily allows to migrate these applications to a supercomputer. The performance(More)
Much computational power on state-of-the-art multicomput-ers like the Paragon is wasted with porting applications. Using networks of workstations is an attempt to withdraw this workload from multicom-puter systems. Therefore an environment is needed which provides the programming interface of multicomputers on coupled workstations. The paper describes the(More)
Today's requirements for computational power are still not satisfied. Supercompu-ters on the one hand achieve good performance figures for a great variety of applications but are expensive to buy and maintain. Multiprocessors like the Paragon XP/S are cheaper but require more effort to port applications. As one consequence, much computational power of such(More)
A resource management system can eeectively shorten the runtime of batch jobs in a network of workstations (NOW). This is achieved with load balancing mechanisms to distribute the load equally among the hosts. To avoid connicts between interactive users and batch jobs, a resource management system must be able to migrate batch jobs from an interactive host(More)