Reinforcement Learning in Multi-Party Trading Dialog

Elnaz Nouri, David Traum, Satoshi Nakamura, Kallirroi Georgila, Takuya Hiraoka

doi:10.18653/v1/w15-4605

【Updated on May 12, 2025】 Integration of CiNii Dissertations and CiNii Books into CiNii Research
Trial version of CiNii Research Knowledge Graph Search feature is available on CiNii Labs
【Updated on June 30, 2025】Suspension and deletion of data provided by Nikkei BP
Regarding the recording of “Research Data” and “Evidence Data”

Reinforcement Learning in Multi-Party Trading Dialog

DOI Open Access

Description

In this paper, we apply reinforcement learning (RL) to a multi-party trading scenario where the dialog system (learner) trades with one, two, or three other agents. We experiment with different RL algorithms and reward functions. The negotiation strategy of the learner is learned through simulated dialog with trader simulators. In our experiments, we evaluate how the performance of the learner varies depending on the RL algorithm used and the number of traders. Our results show that (1) even in simple multi-party trading dialog tasks, learning an effective negotiation policy is a very hard problem; and (2) the use of neural fitted Q iteration combined with an incremental reward function produces negotiation policies as effective or even better than the policies of two strong hand-crafted baselines.

Journal

Proceedings of the 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue

Proceedings of the 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue 32-41, 2015-01-01

Association for Computational Linguistics (ACL)

Details 詳細情報について

CRID

1871146592959463552
DOI

10.18653/v1/w15-4605
Data Source
- OpenAIRE

Reinforcement Learning in Multi-Party Trading Dialog

Description

Journal

Details 詳細情報について

Export

Report a problem