The Difficulty of Training Deep Architectures and the Effect of Unsupervised Pre-Training

@inproceedings{Erhan2009TheDO,
  title={The Difficulty of Training Deep Architectures and the Effect of Unsupervised Pre-Training},
  author={Dumitru Erhan and Pierre-Antoine Manzagol and Yoshua Bengio and Samy Bengio and Pascal Vincent},
  booktitle={AISTATS},
  year={2009}
}
Whereas theoretical work suggests that deep architectures might be more efficient at representing highly-varying functions, training deep architectures was unsuccessful until the recent advent of algorithms based on unsupervised pretraining. Even though these new algorithms have enabled training deep models, many questions remain as to the nature of this difficult learning problem. Answering these questions is important if learning in deep architectures is to be further improved. We attempt to… CONTINUE READING
Highly Influential
This paper has highly influenced 12 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 219 citations. REVIEW CITATIONS
134 Citations
16 References
Similar Papers

Citations

Publications citing this paper.
Showing 1-10 of 134 extracted citations

220 Citations

0204060'10'12'14'16'18
Citations per Year
Semantic Scholar estimates that this publication has 220 citations based on the available data.

See our FAQ for additional information.

Similar Papers

Loading similar papers…