Centres Of Excellence

To focus on new and emerging areas of research and education, Centres of Excellence have been established within the Institute. These ‘virtual' centres draw on resources from its stakeholders, and interact with them to enhance core competencies

Read More >>

Faculty

Faculty members at IIMB generate knowledge through cutting-edge research in all functional areas of management that would benefit public and private sector companies, and government and society in general.

Read More >>

IIMB Management Review

Journal of Indian Institute of Management Bangalore

IIM Bangalore offers Degree-Granting Programmes, a Diploma Programme, Certificate Programmes and Executive Education Programmes and specialised courses in areas such as entrepreneurship and public policy.

Read More >>

About IIMB

The Indian Institute of Management Bangalore (IIMB) believes in building leaders through holistic, transformative and innovative education

Read More >>

An actor–critic algorithm for multi-agent learning in queue-based stochastic games

D. Krishna Sundar and K Ravikumar
Journal Name
Neurocomputing
Journal Publication
others
Publication Year
2014
Journal Publications Functional Area
Organizational Behavior & Human Resources Management
Publication Date
Vol. 127, Issue 15, March 2014, Pg: 258-265
Abstract

We consider state-dependent pricing in a two-player service market stochastic game where state of the game and its transition dynamics are modeled using a semi-Markovian queue. We propose a multi-time scale actor-critic based reinforcement algorithm for multi-agent learning under self-play and provide experimental results on Nash convergence.

An actor–critic algorithm for multi-agent learning in queue-based stochastic games

Author(s) Name: D. Krishna Sundar and K Ravikumar
Journal Name: Neurocomputing
Volume: Vol. 127, Issue 15, March 2014, Pg: 258-265
Year of Publication: 2014
Abstract:

We consider state-dependent pricing in a two-player service market stochastic game where state of the game and its transition dynamics are modeled using a semi-Markovian queue. We propose a multi-time scale actor-critic based reinforcement algorithm for multi-agent learning under self-play and provide experimental results on Nash convergence.