Logo image
Continuous adaptive critic designs
Conference paper   Open access

Continuous adaptive critic designs

T. Hanselmann, L. Noakes and A. Zaknich
Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005., Vol.5, pp.3001-3006
IEEE
International Joint Conference on Neural Networks, IJCNN 2005 (Montreal, Canada, 31/07/2005–04/08/2005)
2005
pdf
continuous_adaptive_critic_designs.pdfDownloadView
Published (Version of Record) Open Access
url
Link to Published Version *Subscription may be requiredView

Abstract

A continuous formulation of an adaptive critic design (ACD) is investigated. Connections to the discrete case are made, where backpropagation through time (BPTT) and realtime recurrent learning (RTRL) are prevalent. A second order actor adaptation, based on Newton's method, is established for fast actor convergence. Also, a fast critic update for concurrent actor-critic training is outlined that keeps the Bellman optimality correct to first order approximation after actor changes.

Details

Metrics

244 File views/ downloads
161 Record Views
Logo image