Ústav teorie informace a automatizace

Jste zde

Bibliografie

Research Report

Approximate Dynamic Programming based on High Dimensional Model Representation

Pištěk Miroslav

: ÚTIA AV ČR, v.v.i, (Praha 2011)

: Research Report 2310

: CEZ:AV0Z10750506

: GAP102/11/0437, GA ČR

: HDMR approximation, Bellman equation, minimization of HDMR functions

: http://library.utia.cas.cz/separaty/2012/AS/pistek-approximate dynamic programming based on high dimensional model representation.pdf

(eng): In this article, an efficient algorithm for an optimal decision strategy approximation is introduced. The proposed approximation of the Bellman equation is based on HDMR technique. This non-parametric function approximation is used not only to reduce memory demands necessary to store Bellman function, but also to allow its fast approximate minimization. On that account, a clear connection between HDMR minimization and discrete optimization is newly established. In each time step of the backward evaluation of the Bellman function, we relax the parameterized discrete minimization subproblem to obtain parameterized trust region problem. We observe that the involved matrix is the same for all parameters owning to the structure of HDMR approximation. We find eigenvalue decomposition of this matrix to solve all trust region problems effectively.

: BC

07.01.2019 - 08:39