Deep reinforcement learning for acceptance strategy in bilateral negotiations

Razeghi, Yousef; Yavuz, Ozan; Aydoğan, Reyhan

doi:10.3906/elk-1907-215

Cited by 17 publications

(18 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Existing approaches with reinforcement learning have focused on methods such as Tabular Q-learning for bidding [12] and finding the optimal concession [34,35] or DQN for bid acceptance [27], which are not optimal for continuous action spaces. Such spaces, however, are the main focus in this work in order to estimate the threshold target utility value below which no bid is accepted/proposed from/to the opponent agent.…”

Section: Related Workmentioning

confidence: 99%

“…(b) Meta-heuristic (or evolutionary) methods -work well across domains and improve iteratively using a fitness function (as a guide for quality); however, in these approaches every time an agent decision is made, this needs to be delivered by the meta-heuristic, which is not efficient and does not result in a human-interpretable and reusable negotiation strategy. (c) Machine learning algorithms -they show the best results with respect to run-time adaptability [8,27], but often their working hypotheses are not interpretable, a fact that may hinder their eventual adoption by users due to lack of transparency in the decision-making that they offer. (d) Interpretable strategy templates -developed in [10] to guide the use of a series of tactics whose optimal use can be learned during negotiation.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Deep Learnable Strategy Templates for Multi-Issue Bilateral Negotiation

Bagga¹,

Paoletti²,

Stathis³

2022

Preprint

View full text Add to dashboard Cite

We study how to exploit the notion of strategy templates to learn strategies for multi-issue bilateral negotiation. Each strategy template consists of a set of interpretable parameterized tactics that are used to decide an optimal action at any time. We use deep reinforcement learning throughout an actor-critic architecture to estimate the tactic parameter values for a threshold utility, when to accept an offer and how to generate a new bid. This contrasts with existing work that only estimates the threshold utility for those tactics. We pre-train the strategy by supervision from the dataset collected using "teacher strategies", thereby decreasing the exploration time required for learning during negotiation. As a result, we build automated agents for multi-issue negotiations that can adapt to different negotiation domains without the need to be pre-programmed. We empirically show that our work outperforms the state-of-the-art in terms of the individual as well as social efficiency.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Deep Learnable Strategy Templates for Multi-Issue Bilateral Negotiation

Bagga¹,

Paoletti²,

Stathis³

2022

Preprint

View full text Add to dashboard Cite

show abstract

“…This is not, however, an ideal solution for large state/action spaces as it may lead to the curse of dimensionality, as well as cause the loss of relevant information about the state/action domain structure. Razeghi et al [43] use Deep Q Networks [36] to design a learnable acceptance strategy based on feedback received from the environment. The main difference of the above Q-learning approaches, when compared to ours, is that they cannot be used in continuous action spaces [31], and thus are inappropriate for our setting.…”

Section: Related Workmentioning

confidence: 99%

ANEGMA: an automated negotiation model for e-markets

Bagga

Paoletti

Alrayes

et al. 2021

Auton Agent Multi-Agent Syst

View full text Add to dashboard Cite

We present a novel negotiation model that allows an agent to learn how to negotiate during concurrent bilateral negotiations in unknown and dynamic e-markets. The agent uses an actor-critic architecture with model-free reinforcement learning to learn a strategy expressed as a deep neural network. We pre-train the strategy by supervision from synthetic market data, thereby decreasing the exploration time required for learning during negotiation. As a result, we can build automated agents for concurrent negotiations that can adapt to different e-market settings without the need to be pre-programmed. Our experimental evaluation shows that our deep reinforcement learning based agents outperform two existing well-known negotiation strategies in one-to-many concurrent bilateral negotiations for a range of e-market settings.

show abstract

“…Then again, in the last couple of decades several studies have looked at the application of reinforcement learning (RL) algorithms like Q-learning [17,20,46,48,49] and REINFORCE [47] in automated negotiation. Recently, Deep Reinforcement learning (DRL) has been used to learn the target utility values [16], the acceptance strategy [43] or both bidding and acceptance strategies [19]. Moreover, authors of [15] have also shown application of DRL in concurrent bilateral negotiation.…”

Section: Rl In Autonomous Negotiationmentioning

confidence: 99%

An Autonomous Negotiating Agent Framework with Reinforcement Learning Based Strategies and Adaptive Strategy Switching Mechanism

Sengupta,

Mohammad,

Nakadai

2021

Preprint

View full text Add to dashboard Cite

Despite abundant negotiation strategies in literature, the complexity of automated negotiation forbids a single strategy from being dominant against all others in different negotiation scenarios. To overcome this, one approach is to use mixture of experts, but at the same time one problem of this method is the selection of experts, as this approach is limited by the competency of the experts selected. Another problem with most negotiation strategies is their incapability of adapting to dynamic variation of the opponent's behaviour within a single negotiation session resulting in poor performance. This work focuses on both, solving the problem of expert selection and adapting to the opponent's behaviour with our Autonomous Negotiating Agent Framework. This framework allows real-time classification of opponent's behaviour and provides a mechanism to select, switch or combine strategies within a single negotiation session. Additionally, our framework has a reviewer component which enables self-enhancement capability by deciding to include new strategies or replace old ones with better strategies periodically. We demonstrate an instance of our framework by implementing maximum entropy reinforcement learning based strategies with a deep learning based opponent classifier. Finally, we evaluate the performance of our agent against state-of-the-art negotiators under varied negotiation scenarios.

show abstract

Deep reinforcement learning for acceptance strategy in bilateral negotiations

Cited by 17 publications

References 21 publications

Deep Learnable Strategy Templates for Multi-Issue Bilateral Negotiation

Deep Learnable Strategy Templates for Multi-Issue Bilateral Negotiation

ANEGMA: an automated negotiation model for e-markets

An Autonomous Negotiating Agent Framework with Reinforcement Learning Based Strategies and Adaptive Strategy Switching Mechanism

Contact Info

Product

Resources

About