TY - GEN
T1 - Online layered learning for cross-layer optimization of dynamic multimedia systems
AU - Mastronarde, Nicholas
AU - Van Der Schaar, Mihaela
PY - 2010/2/22
Y1 - 2010/2/22
N2 - In our recent work, we proposed a systematic cross-layer framework for dynamic multimedia systems, which allows each layer to make autonomous and foresighted decisions that maximize the system's long-term performance, while meeting the application's real-time delay constraints. The proposed solution solved the cross-layer optimization offline, under the assumption that the multimedia system's probabilistic dynamics (e.g. the application's rate-distortion-complexity behavior) were known a priori, by modeling the system as a layered Markov decision process. In practice, however, these dynamics are unknown a priori and therefore must be learned online. In this paper, we address this problem by allowing the multimedia system layers to learn, through repeated interactions with each other, to autonomously optimize the system's long-term performance at run-time. We propose two reinforcement learning algorithms for optimizing the system under different design constraints: the first algorithm solves the cross-layer optimization in a centralized manner, and the second solves it in a decentralized manner. We analyze both algorithms in terms of their required computation, memory, and inter-layer communication overheads. In our experiments, we demonstrate that decentralized learning can perform equally as well as centralized learning, while enabling the layers to act autonomously. Additionally, we show that existing myopic learning algorithms deployed in multimedia systems perform significantly worse than our proposed foresighted learning methods.
AB - In our recent work, we proposed a systematic cross-layer framework for dynamic multimedia systems, which allows each layer to make autonomous and foresighted decisions that maximize the system's long-term performance, while meeting the application's real-time delay constraints. The proposed solution solved the cross-layer optimization offline, under the assumption that the multimedia system's probabilistic dynamics (e.g. the application's rate-distortion-complexity behavior) were known a priori, by modeling the system as a layered Markov decision process. In practice, however, these dynamics are unknown a priori and therefore must be learned online. In this paper, we address this problem by allowing the multimedia system layers to learn, through repeated interactions with each other, to autonomously optimize the system's long-term performance at run-time. We propose two reinforcement learning algorithms for optimizing the system under different design constraints: the first algorithm solves the cross-layer optimization in a centralized manner, and the second solves it in a decentralized manner. We analyze both algorithms in terms of their required computation, memory, and inter-layer communication overheads. In our experiments, we demonstrate that decentralized learning can perform equally as well as centralized learning, while enabling the layers to act autonomously. Additionally, we show that existing myopic learning algorithms deployed in multimedia systems perform significantly worse than our proposed foresighted learning methods.
KW - Cross-layer adaptation to support real-time requirements
KW - Cross-layer multimedia system design
KW - Foresighted decision making
KW - Layered reinforcement learning
UR - https://www.scopus.com/pages/publications/77951259655
U2 - 10.1145/1730836.1730843
DO - 10.1145/1730836.1730843
M3 - Conference contribution
AN - SCOPUS:77951259655
SN - 9781605589145
T3 - MMSys'10 - Proceedings of the 2010 ACM SIGMM Conference on Multimedia Systems
SP - 47
EP - 58
BT - MMSys'10 - Proceedings of the 2010 ACM SIGMM Conference on Multimedia Systems
PB - Association for Computing Machinery
T2 - 1st Annual ACM SIGMM Conference on Multimedia Systems, MMSys 2010
Y2 - 22 February 2010 through 23 February 2010
ER -