Practical recommendations for gradient-based training of deep architectures

  title={Practical recommendations for gradient-based training of deep architectures},
  author={Yoshua Bengio},
  booktitle={Neural Networks: Tricks of the Trade},
Learning algorithms related to artificial neural networks and in particular for Deep Learning may seem to involve many bells and whistles, called hyperparameters. This chapter is meant as a practical guide with recommendations for some of the most commonly used hyper-parameters, in particular in the context of learning algorithms based on backpropagated gradient and gradient-based optimization. It also discusses how to deal with the fact that more interesting results can be obtained when… CONTINUE READING



Citations per Year

843 Citations

Semantic Scholar estimates that this publication has 843 citations based on the available data.

See our FAQ for additional information.

  • GitHub repos referencing this paper

  • Presentations referencing similar topics