标题:Approximate Dynamic Programming via Penalty Functions * * This research was partially funded by the European Commission under the project Local4Global.
摘要:AbstractIn this paper, we propose a novel formulation for encoding state constraints into the Linear Programming approach to Approximate Dynamic Programming via the use of penalty functions. To maintain tractability of the resulting optimization problem that needs to be solved, we suggest a penalty function that is constructed as a point-wise maximum taken over a family of low-order polynomials. Once the penalty functions are designed, no additional approximations are introduced by the proposed formulation. The effectiveness and numerical stability of the formulation is demonstrated through examples.
关键词:KeywordsStochastic optimal control problemsApproximate dynamic programmingSoft constraints