The Policy Iteration Algorithm for Average Continuous Control of Piecewise Deterministic Markov Processes

The Policy Iteration Algorithm for Average Continuous Control of Piecewise Deterministic Markov Processes (2010)

Authors:
- Costa, Oswaldo Luiz do Valle
- Dufour, F
Autor USP: COSTA, OSWALDO LUIZ DO VALLE - EP
Unidade: EP
DOI: 10.1007/s00245-010-9099-4
Assunto: CADEIAS DE MARKOV
Language: Inglês
Abstract: The main goal of this paper is to apply the so-called policy iteration algorithm (PIA) for the long run average continuous control problem of piecewise deterministic Markov processes (PDMP’s) taking values in a general Borel space and with compact action space depending on the state variable. In order to do that we first derive some important properties for a pseudo-Poisson equation associated to the problem. In the sequence it is shown that the convergence of the PIA to a solution satisfying the optimality equation holds under some classical hypotheses and that this optimal solution yields to an optimal control strategy for the average control problem for the continuous-time PDMP in a feedback form
Imprenta:
- Publisher place: New York
- Date published: 2010
Source:
- Título do periódico: Applied Mathematics and Optimization
- Volume/Número/Paginação/Ano: p. 1-19, 03 mar. 2010

Informações sobre o DOI: 10.1007/s00245-010-9099-4 (Fonte: oaDOI API)

A citação é gerada automaticamente e pode não estar totalmente de acordo com as normas

ABNT

COSTA, Oswaldo Luiz do Valle e DUFOUR, F. The Policy Iteration Algorithm for Average Continuous Control of Piecewise Deterministic Markov Processes. Applied Mathematics and Optimization, p. 1-19, 2010Tradução . . Disponível em: https://doi.org/10.1007/s00245-010-9099-4. Acesso em: 24 abr. 2024.
APA

Costa, O. L. do V., & Dufour, F. (2010). The Policy Iteration Algorithm for Average Continuous Control of Piecewise Deterministic Markov Processes. Applied Mathematics and Optimization, 1-19. doi:10.1007/s00245-010-9099-4
NLM

Costa OL do V, Dufour F. The Policy Iteration Algorithm for Average Continuous Control of Piecewise Deterministic Markov Processes [Internet]. Applied Mathematics and Optimization. 2010 ; 1-19.[citado 2024 abr. 24 ] Available from: https://doi.org/10.1007/s00245-010-9099-4
Vancouver

Costa OL do V, Dufour F. The Policy Iteration Algorithm for Average Continuous Control of Piecewise Deterministic Markov Processes [Internet]. Applied Mathematics and Optimization. 2010 ; 1-19.[citado 2024 abr. 24 ] Available from: https://doi.org/10.1007/s00245-010-9099-4