Learning to Forget: Continual Prediction with LSTM

  title={Learning to Forget: Continual Prediction with LSTM},
  author={Felix A. Gers and J{\"u}rgen Schmidhuber and Fred A. Cummins},
  journal={Neural Computation},
Long short-term memory (LSTM; Hochreiter & Schmidhuber, 1997) can solve numerous tasks not solvable by previous learning algorithms for recurrent neural networks (RNNs). We identify a weakness of LSTM networks processing continual input streams that are not a priori segmented into subsequences with explicitly marked ends at which the network's internal state could be reset. Without resets, the state may grow indefinitely and eventually cause the network to break down. Our remedy is a novel… CONTINUE READING
Highly Influential
This paper has highly influenced 100 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 1,297 citations. REVIEW CITATIONS
783 Citations
31 References
Similar Papers


Publications citing this paper.
Showing 1-10 of 783 extracted citations

1,297 Citations

Citations per Year
Semantic Scholar estimates that this publication has 1,297 citations based on the available data.

See our FAQ for additional information.

Similar Papers

Loading similar papers…