Bones: An Automatic Skeleton-Based C-to-CUDA Compiler for GPUs

  title={Bones: An Automatic Skeleton-Based C-to-CUDA Compiler for GPUs},
  author={C. Nugteren and H. Corporaal},
  journal={ACM Trans. Archit. Code Optim.},
  • C. Nugteren, H. Corporaal
  • Published 2014
  • Computer Science
  • ACM Trans. Archit. Code Optim.
  • The shift toward parallel processor architectures has made programming and code generation increasingly challenging. To address this programmability challenge, this article presents a technique to fully automatically generate efficient and readable code for parallel processors (with a focus on GPUs). This is made possible by combining algorithmic skeletons, traditional compilation, and “algorithmic species,” a classification of program code. Compilation starts by automatically annotating C code… CONTINUE READING
    29 Citations
    Generation of high-performance code based on a domain-specific language for algorithmic skeletons
    • 2
    SPINE: From C loop-nests to highly efficient accelerators using Algorithmic Species
    • 2
    Extending OpenACC for Efficient Stencil Code Generation and Execution by Skeleton Frameworks
    • 4
    • PDF
    Automatic CPU/GPU Generation of Multi-versioned OpenCL Kernels for C++ Scientific Applications
    • 7
    (AS)2: Accelerator synthesis using algorithmic skeletons for rapid design space exploration
    • 10
    • PDF
    Matrix multiplication beyond auto-tuning: Rewrite-based GPU code generation
    • 19
    • PDF


    Polyhedral parallel code generation for CUDA
    • 262
    • Highly Influential
    • PDF
    A practical automatic polyhedral parallelizer and locality optimizer
    • 767
    • Highly Influential
    • PDF
    On the Complexity of Loop Fusion
    • A. Darte
    • Computer Science
    • Parallel Comput.
    • 2000
    • 130
    • Highly Influential
    PolyBench: The Polyhedral Benchmark Suite On-line
    • PolyBench: The Polyhedral Benchmark Suite On-line
    • 2013