Two timescale convergent Q-learning for sleep-scheduling in wireless sensor networks

@article{PrashanthL2014TwoTC,
  title={Two timescale convergent Q-learning for sleep-scheduling in wireless sensor networks},
  author={A. L. PrashanthL. and Abhranil Chatterjee and Shalabh Bhatnagar},
  journal={Wireless Networks},
  year={2014},
  volume={20},
  pages={2589-2604}
}
In this paper, we consider an intrusion detection application for Wireless Sensor Networks. We study the problem of scheduling the sleep times of the individual sensors, where the objective is to maximize the network lifetime while keeping the tracking error to a minimum. We formulate this problem as a partially-observable Markov decision process (POMDP) with continuous stateaction spaces, in a manner similar to Fuemmeler and Veeravalli (IEEE Trans Signal Process 56(5), 2091–2101, 2008… CONTINUE READING

References

Publications referenced by this paper.
Showing 1-10 of 31 references

A new Q-learning algorithm with linear function approximation. Technical report SSL, IISc, URL http://stochastic.csa.iisc.ernet.in/www/research/ files/IISc-CSA-SSL-TR-2012-3.pdf

  • S. Bhatnagar, K. Lakshmanan
  • 2012
Highly Influential
3 Excerpts

Stochastic recur- sive algorithms for optimization (Vol. 434)

  • S. Bhatnagar, H. Prasad, L. Prashanth
  • 2013
1 Excerpt

Similar Papers

Loading similar papers…