Corpus ID: 220614668

Improving the Programmability of GPU Architectures

@inproceedings{Nugteren2014ImprovingTP,
  title={Improving the Programmability of GPU Architectures},
  author={C. Nugteren},
  year={2014}
}
  • C. Nugteren
  • Published 2014
  • Computer Science
  • • A submitted manuscript is the author's version of the article upon submission and before peer-review. There can be important differences between the submitted version and the official published version of record. People interested in the research are advised to contact the author for the final version of the publication, or visit the DOI to the publisher's website. • The final author version and the galley proof are versions of the publication after peer review. • The final published version… CONTINUE READING
    5 Citations
    RT-CUDA: A Software Tool for CUDA Code Restructuring
    • 3
    • PDF
    Execution of Dataflow Process Networks on OpenCL Platforms
    • Wictor Lund, Sudeep Kanur, +4 authors U. Falk
    • Computer Science
    • 2015 23rd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing
    • 2015
    • 14
    • PDF
    Power modeling and architectural techniques for energy-efficient GPUs

    References

    SHOWING 1-10 OF 148 REFERENCES
    Polyhedral parallel code generation for CUDA
    • 262
    • Highly Influential
    • PDF
    Ompss: a Proposal for Programming Heterogeneous Multi-Core Architectures
    • 529
    • PDF
    GPUs and the Future of Parallel Computing
    • 496
    • PDF
    hiCUDA: High-Level GPGPU Programming
    • 202
    Generating GPU Code from a High-Level Representation for Image Processing Kernels
    • 13
    • PDF
    A large-scale cross-architecture evaluation of thread-coarsening
    • A. Magni, C. Dubach, M. O’Boyle
    • Computer Science
    • 2013 SC - International Conference for High Performance Computing, Networking, Storage and Analysis (SC)
    • 2013
    • 68
    • PDF
    Auto-tuning a high-level language targeted to GPU codes
    • 300
    • PDF
    Cache Miss Analysis for GPU Programs Based on Stack Distance Profile
    • 36
    Performance Estimation of GPUs with Cache
    • 21