Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 224,872,369 papers from all fields of science
Search
Sign In
Create Free Account
Q-learning
Known as:
Q learning
Q-learning is a model-free reinforcement learning technique. Specifically, Q-learning can be used to find an optimal action-selection policy for any…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
15 relations
Agent-based computational economics
Artificial neural network
Cognitive architecture
Deep learning
Expand
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
2018
2018
Air-Combat Strategy Using Deep Q-Learning
Xiaoteng Ma
,
Li Xia
,
Qianchuan Zhao
ACM Cloud and Autonomic Computing Conference
2018
Corpus ID: 59232612
Unmanned aircraft systems (UAS) are essential components in the future air-combat. Due to high dynamics and randomness of the…
Expand
2016
2016
Effective service composition using multi-agent reinforcement learning
Hongbing Wang
,
Xiaojun Wang
,
Xingzhi Zhang
,
Qi Yu
,
Xingguo Hu
Knowledge-Based Systems
2016
Corpus ID: 37868435
Highly Cited
2014
Highly Cited
2014
Design and Optimization of a (FA)Q-Learning-based HTTP Adaptive Streaming Client
Maxim Claeys
,
Steven Latré
,
J. Famaey
,
Tingyao Wu
,
W. V. Leekwijck
,
F. Turck
2014
Corpus ID: 9068489
,
Highly Cited
2011
Highly Cited
2011
Q-learning based congestion-aware routing algorithm for on-chip network
F. Farahnakian
,
M. Ebrahimi
,
M. Daneshtalab
,
P. Liljeberg
,
J. Plosila
IEEE International Conference on Networked…
2011
Corpus ID: 16073613
Network congestion can limit performance of NoC due to increased transmission latency and power consumption. Congestion-aware…
Expand
Highly Cited
2008
Highly Cited
2008
A quality of service negotiation-based vertical handoff decision scheme in heterogeneous wireless systems
Qingyang Song
,
A. Jamalipour
European Journal of Operational Research
2008
Corpus ID: 2976978
2008
2008
Hybrid Dynamic Control Algorithm for Humanoid Robots Based on Reinforcement Learning
D. Katic
,
A. Rodic
,
M. Vukobratovic
J. Intell. Robotic Syst.
2008
Corpus ID: 7690795
In this paper, hybrid integrated dynamic control algorithm for humanoid locomotion mechanism is presented. The proposed structure…
Expand
Highly Cited
2004
Highly Cited
2004
Organization-based cooperative coalition formation
Sherief Abdallah
,
V. Lesser
ACM International Conference on International…
2004
Corpus ID: 1591796
The coalition formation problem has received a considerable amount of attention in recent years. In this work we present a novel…
Expand
2002
2002
Cooperation in multi-agent bidding
D. Wu
,
Yanjun Sun
Decision Support Systems
2002
Corpus ID: 38951072
Review
1999
Review
1999
State of XCS Classifier System Research
Stewart W. Wilson
Learning Classifier Systems
1999
Corpus ID: 14976846
XCS is a new kind of learning classifier system that differs from the traditional kind primarily in its definition of classifier…
Expand
Highly Cited
1998
Highly Cited
1998
ASGA: Improving the Ant System by Integration with Genetic Algorithms
A. White
,
B. Pagurek
,
F. Oppacher
1998
Corpus ID: 11155258
1 Email: oppacher@scs.carleton.ca ABSTRACT This paper describes how the Ant System can be improved by selfadaptation of its…
Expand
By clicking accept or continuing to use the site, you agree to the terms outlined in our
Privacy Policy
(opens in a new tab)
,
Terms of Service
(opens in a new tab)
, and
Dataset License
(opens in a new tab)
ACCEPT & CONTINUE