TY - GEN
T1 - Reinforcement learning for energy-efficient wireless transmission
AU - Mastronarde, Nicholas
AU - Van Der Schaar, Mihaela
PY - 2011
Y1 - 2011
N2 - We consider the problem of energy-efficient point-to-point transmission of delay-sensitive data (e.g. multimedia data) over a fading channel. We propose a rigorous and unified framework for simultaneously utilizing both physical-layer centric and system-level techniques to minimize energy consumption, under delay constraints, in the presence of stochastic and unknown traffic and channel conditions. We formulate the problem as a Markov decision process and solve it online using reinforcement learning. The advantages of the proposed online method are that it exploits partial information about the system and it obviates the need for action exploration. Consequently, it significantly outperforms existing reinforcement learning solutions.
AB - We consider the problem of energy-efficient point-to-point transmission of delay-sensitive data (e.g. multimedia data) over a fading channel. We propose a rigorous and unified framework for simultaneously utilizing both physical-layer centric and system-level techniques to minimize energy consumption, under delay constraints, in the presence of stochastic and unknown traffic and channel conditions. We formulate the problem as a Markov decision process and solve it online using reinforcement learning. The advantages of the proposed online method are that it exploits partial information about the system and it obviates the need for action exploration. Consequently, it significantly outperforms existing reinforcement learning solutions.
KW - Energy-efficient wireless transmission
KW - Markov decision process
KW - reinforcement learning
UR - https://www.scopus.com/pages/publications/80051639157
U2 - 10.1109/ICASSP.2011.5947128
DO - 10.1109/ICASSP.2011.5947128
M3 - Conference contribution
AN - SCOPUS:80051639157
SN - 9781457705397
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 3452
EP - 3455
BT - 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011 - Proceedings
T2 - 36th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011
Y2 - 22 May 2011 through 27 May 2011
ER -