Centres Of Excellence

To focus on new and emerging areas of research and education, Centres of Excellence have been established within the Institute. These ‘virtual' centres draw on resources from its stakeholders, and interact with them to enhance core competencies

Read More >>

Faculty

Faculty members at IIMB generate knowledge through cutting-edge research in all functional areas of management that would benefit public and private sector companies, and government and society in general.

Read More >>

IIMB Management Review

Journal of Indian Institute of Management Bangalore

IIM Bangalore offers Degree-Granting Programmes, a Diploma Programme, Certificate Programmes and Executive Education Programmes and specialised courses in areas such as entrepreneurship and public policy.

Read More >>

About IIMB

The Indian Institute of Management Bangalore (IIMB) believes in building leaders through holistic, transformative and innovative education

Read More >>

A learning algorithm for risk-sensitive cost

Arnab Basu, Tirthankar Bhattacharyya, Vivek S.Borkar
Journal Name
Mathematics of operations research
Journal Publication
others
Publication Year
2008
Journal Publications Functional Area
Decision Sciences and Information Systems
Publication Date
Vol. 33 (4), PP 880-898, 2008
Abstract

A linear function approximation-based reinforcement learning algorithm is proposed for Markov decision processes with infinite horizon risk-sensitive cost. Its convergence is proved using the "o.d.e. method" for stochastic approximation. The scheme is also extended to continuous state space processes.

A learning algorithm for risk-sensitive cost

Author(s) Name: Arnab Basu, Tirthankar Bhattacharyya, Vivek S.Borkar
Journal Name: Mathematics of operations research
Volume: Vol. 33 (4), PP 880-898, 2008
Year of Publication: 2008
Abstract:

A linear function approximation-based reinforcement learning algorithm is proposed for Markov decision processes with infinite horizon risk-sensitive cost. Its convergence is proved using the "o.d.e. method" for stochastic approximation. The scheme is also extended to continuous state space processes.