Hitoshi Murai

Learn More
We succeeded in getting 14.9 TFLOPS performance when running a plasma simulation code IMPACT-3D parallelized with High Performance Fortran on 512 nodes of the Earth Simulator. The theoretical peak performance of the 512 nodes is 32 TFLOPS, which means 45% of the peak performance was obtained with HPF. IMPACT-3D is an implosion analysis code using TVD(More)
Detail of control technique of bit-wise phase correlation in 160 (4 x 40) Gbit/s optical time division multiplexing (OTDM) signal using a phase-correlation monitor based on 1-bit delay asymmetric interferometers (AIFs) is described. The 1-bit delay AIF transforms a bit-by-bit optical phase discontinuity to an optical power variation, so that it enables to(More)
We are developing HPF/SX V2, an HPF compiler for vector parallel machines. It provides some unique extensions as well as the features of HPF 2.0 and HPF/JA. This paper describes in particular four of them: 1) the ON directive of HPF 2.0, 2) the REFLECT and LOCAL directives of HPF/JA, 3) vectorization directives, and 4) automatic parallelization. We(More)
The present paper introduces the XcalableACC (XACC) programming model, which is a hybrid model of the XcalableMP (XMP) Partitioned Global Address Space (PGAS) language and OpenACC. XACC defines directives that enable programmers to mix XMP and OpenACC directives in order to develop applications that can use accelerator clusters with ease. Moreover, in order(More)
This paper describes new fast integer sorting methods for single vector and shared-memory parallel vector computers, based on the bucket sort algorithm. Existing vectorization methods for bucket sort have made great efforts to avoid store conflicts of vector scatter operations, and therefore are not so efftcient. The vectorization methods shown in this(More)
Given that scientific computer programs are becoming larger and more complicated, high performance application developers routinely examine the program structure of their source code to improve their performance. We have developed K-scope, a source code analysis tool that can be used to improve code performance. K-scope has graphical user interface that(More)
Partitioned Global Address Space (PGAS) programming languages have emerged as a means by which to program parallel computers, which are becoming larger and more complicated. For such languages, regular stencil codes are still one of the most important goals. We implemented three methods of stencil communication in a compiler for a PGAS language XcalableMP,(More)
In this paper, we reveal that some kinds of optical nonlinearities are further enhanced when incoherent light, instead of a laser, is used as a pump light. This idea was confirmed both theoretically and experimentally in the case of sum-frequency generation (SFG) using the optical second nonlinearity. The conversion efficiency of the SFG with incoherent(More)
In order to monitor quality of ultra high bit-rate optical signals in a future optical network, such as 160 Gb/s, a simple monitoring technique is required. Therefore, a novel waveform monitoring technique by prescaled-clock tone detection was proposed in a previous report. In this paper, detailed principle of the proposed technique was explained. The(More)