Adaptive strategy templates using deep reinforcement learning for multi-issue bilateral negotiation

Pallavi Bagga, Nicola Paoletti, Kostas Stathis*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

3 Downloads (Pure)

Abstract

Negotiating in uncertain environments, where user preferences are only partially known, poses a challenge for traditional negotiation models that rely on rigid, pre-defined strategies. These models struggle to adapt to changing conditions or transfer knowledge across different negotiation contexts, making them ineffective in dynamic environments. To address this research gap, we propose a novel negotiation model that uses deep reinforcement learning (DRL) to enable agents learn adaptable, generalizable strategies through the notion of “strategy templates”. These templates include (a) choice parameters to select tactics, (b) time parameters to control when tactics are activated, and (c) attribute-value parameters to guide acceptance and inform bidding decisions. As a result, we enable negotiation agents dynamically adapt their strategies, through pre-training on teacher strategies and refining them via online learning in diverse environments. Our agents also derive a user model to approximate partially specified user preferences, thus handling preference uncertainty more effectively. We developed a proof-of-concept prototype using an actor-critic architecture based on DRL, supplemented by stochastic search techniques for the estimation of user model and multi-objective optimization for making mutually beneficial offers. Experimental evaluations show that our model outperforms state-of-the-art approaches in terms of both individual and social-welfare utilities, demonstrating its ability to transfer experience across domains and excel in previously unseen scenarios. This work provides a robust framework for dynamic, adaptable strategy formation, bridging the gap in current negotiation models by addressing uncertainty in user preferences and strategy flexibility.

Original languageEnglish
Article number129381
JournalNEUROCOMPUTING
Volume623
Early online date20 Jan 2025
DOIs
Publication statusPublished - 28 Mar 2025

Keywords

  • Deep reinforcement learning
  • Interpretable strategy templates
  • Multi-issue bilateral negotiation
  • User preference uncertainty

Fingerprint

Dive into the research topics of 'Adaptive strategy templates using deep reinforcement learning for multi-issue bilateral negotiation'. Together they form a unique fingerprint.

Cite this