Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 229,134,687 papers from all fields of science
Search
Sign In
Create Free Account
Reinforcement learning
Known as:
RL
, Actor critic architecture
, Reward function
Expand
Reinforcement learning is an area of machine learning inspired by behaviorist psychology, concerned with how software agents ought to take actions in…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
50 relations
AIXI
Action selection
Andrew Barto
Anticipation (artificial intelligence)
Expand
Broader (1)
Belief revision
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2018
Highly Cited
2018
Attention Solves Your TSP
W. Kool
,
M. Welling
arXiv.org
2018
Corpus ID: 88524343
We propose a framework for solving combinatorial optimization problems of which the output can be represented as a sequence of…
Expand
Highly Cited
2017
Highly Cited
2017
Near-Optimal Allocation Algorithms for Location-Dependent Tasks in Crowdsensing
Shibo He
,
Dong-Hoon Shin
,
Junshan Zhang
,
Jiming Chen
IEEE Transactions on Vehicular Technology
2017
Corpus ID: 32704838
Crowdsensing offers an efficient way to meet the demand in large-scale sensing applications. In crowdsensing, optimal task…
Expand
Highly Cited
2014
Highly Cited
2014
Design and Optimization of a (FA)Q-Learning-based HTTP Adaptive Streaming Client
Maxim Claeys
,
Steven Latré
,
J. Famaey
,
Tingyao Wu
,
W. V. Leekwijck
,
F. Turck
2014
Corpus ID: 9068489
,
Highly Cited
2008
Highly Cited
2008
Publication IV
P. Alku
2008
Corpus ID: 51740295
A large sample of vowels produced by male and female speakers were inverse filtered and parameterized using 21 different glottal…
Expand
Highly Cited
2008
Highly Cited
2008
A Distributed and Autonomic Virtual Network Mapping Framework
I. Houidi
,
Wajdi Louati
,
D. Zeghlache
International Conference on Autonomic and…
2008
Corpus ID: 7411565
This paper addresses the challenge of assigning virtual networks, through network virtualisation, to the underlying physical…
Expand
2007
2007
The design and evaluation of an intelligent sales agent for online persuasion and negotiation
Shiu-li Huang
,
Fu-Ren Lin
Electronic Commerce Research and Applications
2007
Corpus ID: 2930859
Highly Cited
2003
Highly Cited
2003
Evolution of behaviors in autonomous robot using artificial neural network and genetic algorithm
Malrey Lee
Information Sciences
2003
Corpus ID: 17709385
Highly Cited
1999
Highly Cited
1999
Beginning Elementary School Teachers and the Effective Teaching of Science
I. Ginns
,
J. Watters
1999
Corpus ID: 54743850
Many factors influence the teaching of science by beginning teachers in elementary schools. They have to confront a myriad of…
Expand
Highly Cited
1998
Highly Cited
1998
Maximizing sets and fuzzy Markoff algorithms
L. Zadeh
IEEE Trans. Syst. Man Cybern. Part C
1998
Corpus ID: 9314136
A fuzzy algorithm is an ordered set of fuzzy instructions that upon execution yield an approximate solution to a given problem…
Expand
Highly Cited
1991
Highly Cited
1991
CALIFORNIA BEARING RATIO IMPROVEMENT OF REMOLDED SOILS BY THE ADDITION OF POLYPROPYLENE FIBER REINFORCEMENT
W.
,
K. Humphries
1991
Corpus ID: 107836678
The California bearing ratio (CBR) of a micaceous silt, common to the Piedmont in the southeastern United States, was…
Expand
By clicking accept or continuing to use the site, you agree to the terms outlined in our
Privacy Policy
(opens in a new tab)
,
Terms of Service
(opens in a new tab)
, and
Dataset License
(opens in a new tab)
ACCEPT & CONTINUE