Georg Stellner

Learn More
Checkpointing of parallel applications can be used as the core technology to provide process migration. Both, checkpointing and migration, are an important issue for parallel applications on networks of workstations. The CoCheck environment which we present in this paper introduces a new approach to provide checkpointing and migration for parallel(More)
Checkpoints cannot only be used to increase fault tolerance, but also to migrate processes. The migration is particularly useful in workstation environments where machines become dynamically available and unavailable. We introduce the CoCheck environment which not only allows the creation of checkpoints, but also provides process migration. The creation of(More)
Currently PVM is the standard for developing parallel applications in workstation environments. One of its goals is to use the computational power of idling workstations. In practice many users refrain from opening their machine to other users’ PVM processes. This is due to their experience that such a process, which usually requires a lot of resources (CPU(More)
Much computational power on state-of-the-art multicomput-ers like the Paragon is wasted with porting applications. Using networks of workstations is an attempt to withdraw this workload from multicom-puter systems. Therefore an environment is needed which provides the programming interface of multicomputers on coupled workstations. The paper describes the(More)
Writing parallel programs eeciently is still a problematic issue due to the lack of a uniform and integrated tool support. A new project at LRR-TUM works towards changing this situation for the most popular programming environment on workstation clusters: for PVM. The Tool-set for PVM will comprise a set of integrated tools, consisting of a debugger, a(More)
Running parallel applications in a network of workstations (NOW) requires the use of a resource management system with batch queueing and load balancing functionalities to utilize idle workstations in the NOW and to avoid load imbalance in the network. A resource management system for parallel jobs requires special func-tionalities to schedule jobs to hosts(More)
Workstations provide an attractive platform to develop software. This paper describes the NXLib environment which enables users to use this platform to develop parallel applications. The source code compatibility of NXLib applications with Intel's Paragon multicomputer system easily allows to migrate these applications to a supercomputer. The performance(More)
Today’s requirements for computational power are still not satisfied. Supercomputers on the one hand achieve good performance figures for a great variety of applications but are expensive to buy and maintain. Multiprocessors like the Paragon XP/S are cheaper but require more effort to port applications. As one consequence, much computational power of such(More)