Skip to search formSkip to main contentSkip to account menu

Thread block

A thread block is a programming abstraction that represents a group of threads that can be executing serially or in parallel. For better process and… 
Wikipedia (opens in a new tab)

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
2020
2020
We present a systematic methodology for optimizing batched matrix multiplications on SW26010 many-core processor of the Sunway… 
2015
2015
We present a brute-force approach for finding k-nearest neighbors on the GPU for many queries in parallel. Our program takes… 
2015
2015
Energy efficiency is undoubtedly important for GPU architectures. Besides the traditionally explored energy-efficiency… 
Highly Cited
2015
Highly Cited
2015
GPUs have been proven effective for structured applications that map well to the rigid 1D-3D grid of threads in modern bulk… 
2013
2013
Non-Local Means (NLM) algorithm is widely considered as a state-of-the-art denoising filter in many research fields. High… 
2011
2011
In recent years hardware accelerators have become a full part of the HPC domain as their peak performance has increased steadily… 
Highly Cited
2011
Highly Cited
2011
This work discusses the development of a three-dimensional, high-order, compressible viscous ow solver for mixed unstructured… 
Highly Cited
2008
Highly Cited
2008
We present a technique for designing memory-bound algorithms with high data reuse on Graphics Processing Units (GPUs) equipped…