A view of the parallel computing landscape

@article{Asanovi2009AVO,
  title={A view of the parallel computing landscape},
  author={Krste Asanovi{\'c} and Rastislav Bod{\'i}k and James Demmel and Tony M. Keaveny and Kurt Keutzer and John D. Kubiatowicz and Nelson Morgan and David A. Patterson and Koushik Sen and John Wawrzynek and David Wessel and Katherine A. Yelick},
  journal={Commun. ACM},
  year={2009},
  volume={52},
  pages={56-67}
}
Writing programs that scale with increasing numbers of cores should be as easy as writing programs for sequential computers. 

Figures from this paper

Benchmarking Parallelism and Concurrency in the Encore Programming Language

Due to the limit in speedup of clock speed, most modern computers now sport multicore chips to provide computing power. Currently, programming language support for such machines often consists of e

Skeleton Programming for Heterogeneous GPU-based Systems

In this thesis, we address issues associated with programming modern heterogeneous systems while focusing on a special kind of heterogeneous systems that include multicore CPUs and one or more GPUs

Making parallel programs reliable with stable multithreading

Stable multithreading dramatically simplifies the interleaving behaviors of parallel programs, offering new hope for making parallel programming easier.

Actors and higher order functions : A Comparative Study of Parallel Programming Language Support for Bioinformatics

TLDR
Functional programming can greatly reduce the complexity of parallelization and solve problems when dealing with problems like race conditions and synchronization.

Abstractions for performance programming on multi-core architectures with hierarchical memory

TLDR
Fourions for Performance Programming on Multi-Core Architectures with Hierarchical Memory and a Doktors der Naturwissenschaften genehmigte Dissertation are presented.

Analysis of High Performance Applications Using Workload Requirements

TLDR
Two novel methodologies for analyzing scientific applications in distributed environments, using workload requirements, are proposed and the impact of features such as problem size and programming language, over different computational architectures are explored.

Interrupt driven parallel processing

TLDR
The paper tries to present some modifications in multiprocessor architectures in order to obtain performance increase in computation speed by parallel memory access by skipping the interrupt service routine usual steps.

Review on parallel computing become ubiquitous

TLDR
The future microprocessors will have hundreds of cores and are working on applications, programming environments, and architectures that will meet this challenge and how to achieve the ubiquitous parallel computing is presented.

Sparse-Matrix support for the SkePU library for portable CPU/GPU programming

In this thesis work we have extended the SkePU framework by designing a new container data structure for the representation of generic two dimensional sparse matrices. Computation on matrices is an

Implementing Safe Sharing Features for Encore

Actor isolation is an important property in parallel and concurrent programs that utilize the actor model. However, when expressing certain patterns isolation can sometimes be too strong and forces
...

References

SHOWING 1-10 OF 42 REFERENCES

Amdahl's Law in the Multicore Era

  • M. Hill
  • Computer Science
    Computer
  • 2008
Augmenting Amdahl's law with a corollary for multicore hardware makes it relevant to future generations of chips with multiple processor cores. Obtaining optimal multicore performance will require

Patterns for parallel programming

TLDR
This book is the first parallel programming guide written specifically to serve working software developers, not just computer scientists, and introduces a complete, highly accessible pattern language that will help any experienced developer "think parallel"-and start writing effective parallel code almost immediately.

Roofline: an insightful visual performance model for multicore architectures

TLDR
The Roofline model offers insight on how to improve the performance of software and hardware in the rapidly changing world of connected devices.

Optimization of sparse matrix-vector multiplication on emerging multicore platforms

TLDR
This work examines sparse matrix-vector multiply (SpMV) - one of the most heavily used kernels in scientific computing - across a broad spectrum of multicore designs, and presents several optimization strategies especially effective for the multicore environment.

Language Extensions in Support of Compiler Parallelization

TLDR
This paper proposes an approach to automatic compiler parallelization based on language extensions that is applicable to a broader range of program structures and application domains than in past work and motivates future work on building an automatically parallelizing compiler.

The Parallel Computing Laboratory at U.C. Berkeley: A Research Agenda Based on the Berkeley View

TLDR
This report is based on a proposal for creating a Universal Parallel Computing ResearchCenter (UPCRC) that a technical committee from Intel and Microsoft unanimously selected as the top proposal in a competition with the top 25 computer science departments.

An Introduction to Software Architecture

  • D. GarlanM. Shaw
  • Computer Science
    Advances in Software Engineering and Knowledge Engineering
  • 1993
TLDR
This paper provides an introduction to the emerging field of software architecture by considering a number of common architectural styles upon which many systems are currently based and showing how different styles can be combined in a single design.

Model Checking Multithreaded Programs with Asynchronous Atomic Methods

TLDR
This paper presents a programming language Spl, which extends simplified form of sequential Java to which it adds the capability of making asynchronous method invocations in addition to the standard synchronous method calls and the ability to execute asynchronous methods in threads atomically and concurrently.

Self-Adapting Linear Algebra Algorithms and Software

TLDR
The generation of dense and sparse Basic Linear Algebra Subprograms (BLAS) kernels and the selection of linear solver algorithms are described.

[서평]「Computer Organization and Design, The Hardware/Software Interface」

Recognizing the mannerism ways to get this books computer organization and design the hardware software interface 4th fourth edition by patterson hennessy is additionally useful. You have remained in