— This paper presents ZHT, a zero-hop distributed hash table, which has been tuned for the requirements of high-end computing systems. ZHT aims to be a building block for future distributed systems, such as parallel and distributed file systems, distributed job management systems, and parallel programming systems. The goals of ZHT are delivering high… (More)
iii ACKNOWLEDGEMENT It gives me a great pleasure to thank all those people who have helped me and contributed for the successful completion of this work. First of all, I would like to thank my advisor Dr. Ioan Raicu for giving me the opportunity to work under him on this project. The course I took under him triggered my passion in the field of distributed… (More)
Exascale systems will bring significant challenges. This work attempts to addresses them through the Many-Task Computing (MTC) paradigm, by delivering data-aware job scheduling systems and fully asynchronous distributed architectures. MTC applications are structured as DAG graphs of tasks, with dependencies forming the edges. The asynchronous nature of MTC… (More)
Efficiently scheduling large number of jobs over large-scale distributed systems is critical in achieving high system utilization and throughput. Today's state-of-the-art job management systems have predominantly Master/Slaves architectures, which have inherent limitations, such as scalability issues at extreme scales (e.g. petascales and beyond) and single… (More)
and 53 ms of latency. Results show that up to 97% efficiency of such high bandwidth high latency network is possible, achieving 80-90 Gbps in most test cases with a peak transfer rate of 100 Gbps.
Exascale computers (with millions of nodes and billions of cores) will enable the unraveling of significant scientific mysteries around the year 2019. Many-task computing (MTC) is a new viable distributed paradigm for extreme-scale supercomputing. The MTC paradigm can address three of the four major challenges of exascale computing, namely Concurrency and… (More)