Craig Stunkel

Learn More
Collective communication performance is critical for many applications. In this paper, we present an architecture to efficiently support collective operations (like multi-casts and reductions) in the switches of parallel computer interconnects. We present an output queuing switch architecture with cross-point buffering. Output queuing archi-tectures have(More)
— This paper presents Unified Communication X (UCX), a set of network APIs and their implementations for high throughput computing. UCX comes from the combined efforts of national laboratories, industry, and academia to design and implement a high-performing and highly-scalable network stack for next generation applications and systems. UCX design provides(More)
  • 1