When a floating-point arithmetic is executed on a processor unit, round-off and truncation errors occur every calculation. These errors cause a precision issue in a large simulation which requires a great number of calculations. Therefore, we have developed the quadruple-precision basic linear algebra subprograms (QPBLAS) based on Bailey's double-double… (More)

@article{Yamada2017QuadruplePrecisionBU,
title={Quadruple-Precision BLAS Using Bailey's Arithmetic with FMA Instruction: Its Performance and Applications},
author={Susumu Yamada and Toshiyuki Imamura and Takuya Ina and Narimasa Sasa and Yasuhiro Idomura and Masahiko Machida},
journal={2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)},
year={2017},
pages={1418-1425}
}