Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 234,670,165 papers from all fields of science
Search
Sign In
Create Free Account
Bellman equation
Known as:
Bellman-Equation
, Bellman's optimality principle
, Policy function
Expand
A Bellman equation, named after its discoverer, Richard Bellman, also known as a dynamic programming equation, is a necessary condition for…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
35 relations
Algebraic Riccati equation
Artificial neural network
Automatic basis function construction
Backward induction
Expand
Broader (2)
Control theory
Dynamic programming
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
2014
2014
Surprise and Curiosity for Big Data Robotics
Adam White
,
Joseph Modayil
,
R. Sutton
2014
Corpus ID: 14782363
This paper introduces a new perspective on curiosity and intrinsic motivation, viewed as the problem of generating behavior data…
Expand
Highly Cited
2011
Highly Cited
2011
Multigrid Methods for
Hamilton-Jacobi-Bellman
,
Hamilton-Jacobi Equations
,
D. Han
2011
Corpus ID: 2233819
We propose multigrid methods for solving Hamilton-Jacobi-Bellman (HJB) and HamiltonJacobi-Bellman-Isaacs (HJBI) equations. The…
Expand
2005
2005
On a method for mending time to failure distributions
Michael Grottke
,
Kishor S. Trivedi
Dependable Systems and Networks
2005
Corpus ID: 12004181
Many software reliability growth models assume that the time to next failure may be infinite; i.e., there is a chance that no…
Expand
2004
2004
A fast point-based algorithm for POMDPs
N. Vlassis
,
M. Spaan
2004
Corpus ID: 567199
We describe a point-based approximate value iteration algorithm for partially observable Markov decision processes. The algorithm…
Expand
Highly Cited
1996
Highly Cited
1996
Generalized maze navigation: SRN critics solve what feedforward or Hebbian nets cannot
P. Werbos
,
X. Pang
IEEE International Conference on Systems, Man and…
1996
Corpus ID: 60939529
Previous papers have explained why model-based adaptive critic designs-unlike other designs used in neurocontrol-have the…
Expand
1989
1989
Possibilistic Linear Programming with Measurable Multiattribute Value Functions
M. Inuiguchi
,
H. Ichihashi
,
Hideo Tanaka
INFORMS journal on computing
1989
Corpus ID: 912838
In this paper, a possibilistic linear program is formulated when a measurable multiattribute value function is given. The…
Expand
1986
1986
Everything You Always Wanted to Know about Linearization
D. Claude
1986
Corpus ID: 123429268
The state space approach, principally introduced by Bellman † (cf. [1]) and Kalman, deeply changed the outlook on automatic…
Expand
Highly Cited
1979
Highly Cited
1979
A Dynamic Programming Algorithm for Phase Estimation and Data Decoding on Random Phase Channels
O. Macchi
,
L. Scharf
1979
Corpus ID: 60746626
Abstract : The problem of simultaneously estimating phase and decoding data symbols from baseband data is posed. The phase…
Expand
Highly Cited
1970
Highly Cited
1970
Electrostatic Plasma Instabilities Excited by a High‐Frequency Electric Field
J. Sanmartín
1970
Corpus ID: 53130735
The electrostatic plasma waves excited by a uniform, alternating electric field of arbitrary intensity are studied on the basis…
Expand
1948
1948
On stability of free laminar boundary layer between parallel streams
M. Lessen
1948
Corpus ID: 55027817
An analysis and calculations on the stability of the free laminar boundary layer between parallel streams were made for an…
Expand