Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 227,333,806 papers from all fields of science
Search
Sign In
Create Free Account
Reinforcement learning
Known as:
RL
, Actor critic architecture
, Reward function
Expand
Reinforcement learning is an area of machine learning inspired by behaviorist psychology, concerned with how software agents ought to take actions in…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
50 relations
AIXI
Action selection
Andrew Barto
Anticipation (artificial intelligence)
Expand
Broader (1)
Belief revision
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2017
Highly Cited
2017
Near-Optimal Allocation Algorithms for Location-Dependent Tasks in Crowdsensing
Shibo He
,
Dong-Hoon Shin
,
Junshan Zhang
,
Jiming Chen
IEEE Transactions on Vehicular Technology
2017
Corpus ID: 32704838
Crowdsensing offers an efficient way to meet the demand in large-scale sensing applications. In crowdsensing, optimal task…
Expand
2013
2013
When simplicity meets optimality: Efficient transmission power control with stochastic energy harvesting
Qingsi Wang
,
M. Liu
Proceedings IEEE INFOCOM
2013
Corpus ID: 15489558
We consider the optimal transmission power control of a single wireless node with stochastic energy harvesting and an infinite…
Expand
2009
2009
Covariant Policy Search
W. Liu
,
Sanjiang Li
,
Jochen Renz
International Joint Conference on Artificial…
2009
Corpus ID: 117131270
Increasing the expressiveness of qualitative spatial calculi is an essential step towards meeting the requirements of…
Expand
Highly Cited
2008
Highly Cited
2008
A Distributed and Autonomic Virtual Network Mapping Framework
I. Houidi
,
Wajdi Louati
,
D. Zeghlache
International Conference on Autonomic and…
2008
Corpus ID: 7411565
This paper addresses the challenge of assigning virtual networks, through network virtualisation, to the underlying physical…
Expand
2005
2005
Experimental behaviour and strength of concrete-encased composite beam–columns with T-shaped steel section under cyclic loading
Cheng-Chih Chen
,
Jian Ming Li
,
C. Weng
2005
Corpus ID: 56569681
Highly Cited
2005
Highly Cited
2005
A micro-simulation model system of departure time using a perception updating model under travel time uncertainty
D. Ettema
,
G. Tamminga
,
H. Timmermans
,
T. Arentze
2005
Corpus ID: 14276499
Highly Cited
2003
Highly Cited
2003
Evolution of behaviors in autonomous robot using artificial neural network and genetic algorithm
Malrey Lee
Information Sciences
2003
Corpus ID: 17709385
Highly Cited
1999
Highly Cited
1999
Beginning Elementary School Teachers and the Effective Teaching of Science
I. Ginns
,
J. Watters
1999
Corpus ID: 54743850
Many factors influence the teaching of science by beginning teachers in elementary schools. They have to confront a myriad of…
Expand
Highly Cited
1998
Highly Cited
1998
Maximizing sets and fuzzy Markoff algorithms
L. Zadeh
IEEE Trans. Syst. Man Cybern. Part C
1998
Corpus ID: 9314136
A fuzzy algorithm is an ordered set of fuzzy instructions that upon execution yield an approximate solution to a given problem…
Expand
Highly Cited
1996
Highly Cited
1996
EFFECT OF FRP REINFORCEMENT ON LOW GRADE EASTERN HEMLOCK GLULAMS
H. Dagher
,
T. Kimball
,
S. Shaler
,
B. Abdel-Magid
1996
Corpus ID: 15929725
The benefits of reinforcing glulam beams made with eastern hemlock, an under-utilized wood species in the state of Maine, are…
Expand
By clicking accept or continuing to use the site, you agree to the terms outlined in our
Privacy Policy
(opens in a new tab)
,
Terms of Service
(opens in a new tab)
, and
Dataset License
(opens in a new tab)
ACCEPT & CONTINUE