Abstract
We consider the problem of energy-efficient point-to-point transmission of delay-sensitive data (e.g., multimedia data) over a fading channel. We propose a rigorous and unified framework for simultaneously utilizing both physical-layer and system-level techniques to minimize energy consumption, under delay constraints, in the presence of stochastic and unknown traffic and channel conditions. We formulate the problem as a Markov decision process and solve it online using reinforcement learning. The advantages of the proposed online method are that i) it does not require a priori knowledge of the traffic arrival and channel statistics to determine the jointly optimal physical-layer and system-level power management strategies; ii) it exploits partial information about the system so that less information needs to be learned than when using conventional reinforcement learning algorithms; and iii) it obviates the need for action exploration, which severely limits the adaptation speed and run-time performance of conventional reinforcement learning algorithms.
| Original language | English |
|---|---|
| Article number | 5986747 |
| Pages (from-to) | 6262-6266 |
| Number of pages | 5 |
| Journal | IEEE Transactions on Signal Processing |
| Volume | 59 |
| Issue number | 12 |
| DOIs | |
| State | Published - Dec 2011 |
Fingerprint
Dive into the research topics of 'Fast reinforcement learning for energy-efficient wireless communication'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver