## Figures and Tables from this paper

## 19,408 Citations

Gradient Flow in Recurrent Nets: The Difficulty of Learning LongTerm Dependencies

- Biology
- 2001

This chapter contains sections titled: Introduction Exponential Error Decay Dilemma: Avoiding Aradient Decay Prevents Long-Term Latching Remedies Conclusion
]]>

Stochastic Learning

- Computer ScienceAdvanced Lectures on Machine Learning
- 2003

This contribution presents an overview of the theoretical and practical aspects of the broad family of learning algorithms based on Stochastic Gradient Descent, including Perceptrons, Adalines,…

Backpropagation in matrix notation

- MathematicsArXiv
- 2017

In this note, the gradient of the network function is calculated in matrix notation to solve the inequality of the following type: For α ≥ 1, β ≥ 1 using LaSalle's inequality.

Neural Networks and Complexity Theory

- MathematicsNord. J. Comput.
- 1992

Some of the central results in the complexity theory of neural networks, with pointers to the literature, are surveyed.

Soft-Competitive Learning Paradigms

- Computer Science
- 2000

Learning is the ability to autonomously select, update, and store relevant information in memory; and the ability to predict and create based on what has been learned.

Mathematical Programming in Machine Learning

- Computer Science
- 1996

A number of central problems of machine learning are described and shown how they can be modeled and solved as mathematical programs of various complexity.

Mathematical Programming in Machine Learning

- Computer Science
- 1996

A number of central problems of machine learning are described and shown how they can be modeled and solved as mathematical programs of various complexity.

Nurture, Nature, Structure a Computational Approach to Learning

Master Thesis on Cognitive Science and Artiicial Intelligence written under the supervision of dr. i Soo een roerlijck schepsel uyt onroerlijcke stooe voortquam, dat soude een mirakel zyn 1

## References

SHOWING 1-10 OF 66 REFERENCES

Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations

- Computer Science
- 1986

The fundamental principles, basic mechanisms, and formal analyses involved in the development of parallel distributed processing (PDP) systems are presented in individual chapters contributed by…

Harmony Theory: Problem Solving, Parallel Cognitive Models, and Thermal Physics.

- Physics
- 1984

Abstract : The first paper describes a parallel model designed to solve a class of relatively simple problems from elementary physics, and discusses the implications for models of problem solving in…

Harmony Theory: A Mathematical Framework for Stochastic Parallel Processing.

- Computer Science
- 1983

As this temperature is lowered, the system appears to display a dramatic tendency to coherently interpret input, even if the evidence for any particular interpretation is very weak.

Inductive Information Retrieval Using Parallel Distributed Computation.

- Computer Science
- 1984

The retrieval system described makes dynamic use of the internal structure of a database to infer relationships among items in the database, which can help overcome incompleteness and imprecision in requests for information, as well as in thedatabase itself.

Pattern-recognizing stochastic learning automata

- Computer ScienceIEEE Transactions on Systems, Man, and Cybernetics
- 1985

A class of learning tasks is described that combines aspects of learning automation tasks and supervised learning pattern-classification tasks. These tasks are called associative reinforcement…

A Learning Algorithm for Boltzmann Machines

- Computer ScienceCogn. Sci.
- 1985

A general parallel search method is described, based on statistical mechanics, and it is shown how it leads to a general learning rule for modifying the connection strengths so as to incorporate knowledge about a task domain in an efficient way.

Analogical Processes in Learning

- Computer Science
- 1980

The role of analogy and procedural representation in learning is examined from several domains, including turtle geometry, kinship terms, and the learning of a computer text editor.

Learning by statistical cooperation of self-interested neuron-like computing elements.

- Computer ScienceHuman neurobiology
- 1985

It is argued that some of the longstanding problems concerning adaptation and learning by networks might be solvable by this form of cooperativity, and computer simulation experiments are described that show how networks of self-interested components that are sufficiently robust can solve rather difficult learning problems.

Separating Figure from Ground with a Parallel Network

- Computer SciencePerception
- 1986

The network model is too simplified to serve as a model of human performance, but it does demonstrate that one global property of outlines can be computed through local interactions in a parallel network.