DianNao: a small-footprint high-throughput accelerator for ubiquitous machine-learning

@article{Chen2014DianNaoAS,
  title={DianNao: a small-footprint high-throughput accelerator for ubiquitous machine-learning},
  author={Tianshi Chen and Zidong Du and Ninghui Sun and J. Wang and Chengyong Wu and Yunji Chen and O. Temam},
  journal={Proceedings of the 19th international conference on Architectural support for programming languages and operating systems},
  year={2014}
}
  • Tianshi Chen, Zidong Du, +4 authors O. Temam
  • Published 2014
  • Computer Science
  • Proceedings of the 19th international conference on Architectural support for programming languages and operating systems
Machine-Learning tasks are becoming pervasive in a broad range of domains, and in a broad range of systems (from embedded systems to data centers. [...] Key Result Such a high throughput in a small footprint can open up the usage of state-of-the-art machine-learning algorithms in a broad set of systems and for a broad set of applications.Expand
1,047 Citations
DianNao family
  • 50
Fast and Efficient Convolutional Accelerator for Edge Computing
  • 6
SCALEDEEP: A scalable compute architecture for learning and evaluating deep networks
  • 119
  • Highly Influenced
Eyeriss v2: A Flexible Accelerator for Emerging Deep Neural Networks on Mobile Devices
  • 179
  • PDF
DaDianNao: A Machine-Learning Supercomputer
  • Yunji Chen, Tao Luo, +8 authors O. Temam
  • Computer Science
  • 2014 47th Annual IEEE/ACM International Symposium on Microarchitecture
  • 2014
  • 921
  • PDF
DaDianNao: A Neural Network Supercomputer
  • 78
  • Highly Influenced
SOLAR: Services-Oriented Deep Learning Architectures-Deep Learning as a Service
  • 3
Understanding the Impact of On-chip Communication on DNN Accelerator Performance
  • 3
  • PDF
Low-power accelerators for cognitive computing
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 13 REFERENCES
Improving the speed of neural networks on CPUs
  • 603
  • Highly Influential
  • PDF
A defect-tolerant accelerator for emerging high-performance applications
  • O. Temam
  • Computer Science
  • 2012 39th Annual International Symposium on Computer Architecture (ISCA)
  • 2012
  • 149
  • Highly Influential
  • PDF
Understanding sources of inefficiency in general-purpose chips
  • 424
  • Highly Influential
  • PDF
Convolution engine: balancing efficiency & flexibility in specialized computing
  • 153
  • Highly Influential
  • PDF
Building high-level features using large scale unsupervised learning
  • 1,973
  • Highly Influential
  • PDF
NeuFlow: A runtime reconfigurable dataflow processor for vision
  • 324
  • Highly Influential
  • PDF
A 201.4 GOPS 496 mW Real-Time Multi-Object Recognition Processor With Bio-Inspired Neural Perception Engine
  • 124
  • Highly Influential
  • PDF
An empirical evaluation of deep architectures on problems with many factors of variation
  • 863
  • Highly Influential
  • PDF
Traffic sign recognition with multi-scale Convolutional Networks
  • 586
  • Highly Influential
  • PDF
Convolutional neural networks applied to house numbers digit classification
  • 403
  • Highly Influential
  • PDF
...
1
2
...