A Deep Neural Network Compression Pipeline: Pruning, Quantization, Huffman Encoding
@inproceedings{Han2015ADN, title={A Deep Neural Network Compression Pipeline: Pruning, Quantization, Huffman Encoding}, author={Song Han and Huizi Mao and W. J. Dally}, year={2015} }
Neural networks are both computationally intensive and memory intensive, making them difficult to deploy on embedded systems with limited hardware resources. [...] Key Method Our method first prunes the network by learning only the important connections. Next, we quantize the weights to enforce weight sharing, finally, we apply Huffman encoding. After the first two steps we retrain the network to fine tune the remaining connections and the quantized centroids.Expand Abstract
Supplemental Video
Figures, Tables, and Topics from this paper
49 Citations
Deep Neural Network Compression Method Based on Product Quantization
- Computer Science
- 2020 39th Chinese Control Conference (CCC)
- 2020
Scalpel: Customizing DNN pruning to the underlying hardware parallelism
- Computer Science
- 2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA)
- 2017
- 215
- Highly Influenced
- PDF
Structured Pruning of Deep Convolutional Neural Networks
- Computer Science, Mathematics
- ACM J. Emerg. Technol. Comput. Syst.
- 2017
- 337
- Highly Influenced
- PDF
Compression of Deep Convolutional Neural Networks for Fast and Low Power Mobile Applications
- Computer Science
- ICLR
- 2016
- 523
- Highly Influenced
- PDF
Pruning Deep Convolutional Neural Networks for Fast Inference
- Computer Science
- 2017
- Highly Influenced
- PDF
References
SHOWING 1-10 OF 24 REFERENCES
ImageNet classification with deep convolutional neural networks
- Computer Science
- Commun. ACM
- 2012
- 59,744
- PDF
Exploiting Linear Structure Within Convolutional Networks for Efficient Evaluation
- Computer Science
- NIPS
- 2014
- 1,021
- PDF
Going deeper with convolutions
- Computer Science
- 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
- 2015
- 22,376
- PDF
Deep Fried Convnets
- Computer Science, Mathematics
- 2015 IEEE International Conference on Computer Vision (ICCV)
- 2015
- 209
- PDF