Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 233,432,728 papers from all fields of science
Search
Sign In
Create Free Account
Reinforcement learning
Known as:
RL
, Actor critic architecture
, Reward function
Expand
Reinforcement learning is an area of machine learning inspired by behaviorist psychology, concerned with how software agents ought to take actions in…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
50 relations
AIXI
Action selection
Andrew Barto
Anticipation (artificial intelligence)
Expand
Broader (1)
Belief revision
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2018
Highly Cited
2018
Attention Solves Your TSP
W. Kool
,
M. Welling
arXiv.org
2018
Corpus ID: 88524343
We propose a framework for solving combinatorial optimization problems of which the output can be represented as a sequence of…
Expand
2009
2009
Covariant Policy Search
W. Liu
,
Sanjiang Li
,
Jochen Renz
International Joint Conference on Artificial…
2009
Corpus ID: 117131270
Increasing the expressiveness of qualitative spatial calculi is an essential step towards meeting the requirements of…
Expand
Highly Cited
2008
Highly Cited
2008
Publication IV
P. Alku
2008
Corpus ID: 51740295
A large sample of vowels produced by male and female speakers were inverse filtered and parameterized using 21 different glottal…
Expand
Highly Cited
2008
Highly Cited
2008
A Distributed and Autonomic Virtual Network Mapping Framework
I. Houidi
,
Wajdi Louati
,
D. Zeghlache
International Conference on Autonomic and…
2008
Corpus ID: 7411565
This paper addresses the challenge of assigning virtual networks, through network virtualisation, to the underlying physical…
Expand
2001
2001
Modeling user interest shift using a bayesian approach
Wai Lam
,
Javed Mostafa
J. Assoc. Inf. Sci. Technol.
2001
Corpus ID: 17371992
We investigate the modeling of changes in user interest in information filtering systems. A new technique for tracking user…
Expand
Highly Cited
1998
Highly Cited
1998
Maximizing sets and fuzzy Markoff algorithms
L. Zadeh
IEEE Trans. Syst. Man Cybern. Part C
1998
Corpus ID: 9314136
A fuzzy algorithm is an ordered set of fuzzy instructions that upon execution yield an approximate solution to a given problem…
Expand
Review
1998
Review
1998
Distance Learning
Lisa Gualtieri
CHI Conference Summary
1998
Corpus ID: 31417973
This tutorial covers how to design and deliver a distance learning class. The motivation for distance learning programs is…
Expand
1993
1993
Structurally reinforced macrocyclic ligands
R. D. Hancock
,
G. Pattrick
,
P. W. Wade
,
G. D. Hosken
1993
Corpus ID: 53445769
Abstract
Highly Cited
1991
Highly Cited
1991
CALIFORNIA BEARING RATIO IMPROVEMENT OF REMOLDED SOILS BY THE ADDITION OF POLYPROPYLENE FIBER REINFORCEMENT
W.
,
K. Humphries
1991
Corpus ID: 107836678
The California bearing ratio (CBR) of a micaceous silt, common to the Piedmont in the southeastern United States, was…
Expand
1963
1963
SHARING IN PRESCHOOL CHILDREN AS A FUNCTION OF AMOUNT AND TYPE OF REINFORCEMENT.
Fischer Wf
1963
Corpus ID: 148122345
By clicking accept or continuing to use the site, you agree to the terms outlined in our
Privacy Policy
(opens in a new tab)
,
Terms of Service
(opens in a new tab)
, and
Dataset License
(opens in a new tab)
ACCEPT & CONTINUE