Tieqiang Mo

We don’t have enough information about this author to calculate their statistics. If you think this is an error let us know.
Learn More
When optimizing performance on a GPU, control flow divergence of threads in one warp can make up the possible performance bottlenecks. In our hand-coded GPU stencil computation optimization, with a view to remove this control flow divergence brought by conventional mapping method between global memory and shared memory, we devise a new mapping mechanism by(More)
  • 1