Longxiang Chen

Learn More
Goal: Improving communication performance of distributed matrix multiplication to achieve energy efficiency  Devise a high performance communication scheme o Fully exploiting network bandwidth of distributed matrix multiplication via non-blocking pipeline broadcast with tuned chunk size  Model and quantify the communication time complexity of binomial(More)
Soft errors are one-time events that corrupt the state of a computing system but not its overall functionality. Soft errors normally do not interrupt the execution of the affected program, but the affected computation results can not be trusted any more. A well known technique to correct soft errors in matrix-matrix multiplication is algorithm-based fault(More)
The demands of improving energy efficiency for high performance scientific applications arise crucially nowadays. Software-controlled hardware solutions directed by Dynamic Voltage and Frequency Scaling (DVFS) have shown their effectiveness extensively. Although DVFS is beneficial to green computing, introducing DVFS itself can incur non-negligible(More)
Keywords: Algorithm-based fault tolerance Matrix multiplication Fault tolerant linear algebra On-line algorithm based fault tolerance a b s t r a c t Soft errors are one-time events that corrupt the state of a computing system but not its overall func-tionality. Soft errors normally do not interrupt the execution of the affected program, but the affected(More)
Keywords: Power and energy Performance Power management Supercomputers Numerical linear algebra DVFS a b s t r a c t Extreme scale supercomputers available before the end of this decade are expected to have 100 million to 1 billion computing cores. The power and energy efficiency issue has become one of the primary concerns of extreme scale high performance(More)
  • 1