Skip to main navigation Skip to search Skip to main content

Faster algorithm and sharper analysis for constrained Markov decision process

  • Tianjiao Li
  • , Ziwei Guan
  • , Shaofeng Zou
  • , Tengyu Xu
  • , Yingbin Liang
  • , Guanghui Lan
  • Georgia Institute of Technology
  • Ohio State University
  • Meta

Research output: Contribution to journalArticlepeer-review

21 Scopus citations

Abstract

The problem of constrained Markov decision process (CMDP) is investigated, where an agent aims to maximize the expected accumulated reward subject to constraints on its utilities/costs. We propose a new primal-dual approach with a novel integration of entropy regularization and Nesterov's accelerated gradient method. The proposed approach is shown to converge to the global optimum with a complexity of O˜(1/ϵ) in terms of the optimality gap and the constraint violation, which improves the complexity of the existing primal-dual approaches by a factor of O(1/ϵ).

Original languageEnglish
Article number107107
JournalOperations Research Letters
Volume54
DOIs
StatePublished - May 2024

Keywords

  • Accelerated gradient method
  • Constrained Markov decision process
  • Entropy regularization
  • Policy optimization
  • Primal-dual algorithm

Fingerprint

Dive into the research topics of 'Faster algorithm and sharper analysis for constrained Markov decision process'. Together they form a unique fingerprint.

Cite this