SnaPEA : Predictive Early Activation for Reducing Computation in Deep Convolutional Neural Networks

Abstract

Deep Convolutional Neural Networks (CNNs) perform billions of operations for classifying a single input. To reduce these computations, this paper offers a solution that leverages a combination of runtime information and the algorithmic structure of CNNs. Specifically, in numerous modern CNNs, the outputs of compute-heavy convolution operations are fed to… (More)

16 Figures and Tables