A learning algorithm for risk-sensitive cost

Arnab Basu, Tirthankar Bhattacharyya, Vivek S.Borkar

Journal Name

Mathematics of operations research

Journal Publication

others

Publication Year

2008

Decision Sciences and Information Systems

Publication Date

Vol. 33 (4), PP 880-898, 2008

Abstract

A linear function approximation-based reinforcement learning algorithm is proposed for Markov decision processes with infinite horizon risk-sensitive cost. Its convergence is proved using the "o.d.e. method" for stochastic approximation. The scheme is also extended to continuous state space processes.

A learning algorithm for risk-sensitive cost

Author(s) Name: Arnab Basu, Tirthankar Bhattacharyya, Vivek S.Borkar

Journal Name: Mathematics of operations research

Volume: Vol. 33 (4), PP 880-898, 2008

Year of Publication: 2008

Abstract:

Certificate Programmes

UG Programmes

CENTRES OF EXCELLENCE

IIMB Management Review

Journal of Indian Institute of Management Bangalore

CENTRES OF EXCELLENCE

Centres Of Excellence

Certificate Programmes

UG Programmes

Faculty

IIMB Institutional Review Board (IRB)

IIMB Institutional Review Board (IRB)

IIMB Management Review

Journal of Indian Institute of Management Bangalore

About IIMB

A learning algorithm for risk-sensitive cost

A learning algorithm for risk-sensitive cost

Contact us