Near-Optimal Bisection Search for Nonparametric Dynamic Pricing with Inventory Constraint

Lei, Yanzhe; Jasin, Stefanus; Sinha, Amitabh

doi:10.2139/ssrn.2509425

Cited by 26 publications

(37 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Before proceeding, we introduce a modified stochastic system. Technically, if the inventory is depleted at t, then P m (t) must be switched to p ∞ , a choke price at which the demand of type-m customers is turned off, for all m. We use a similar simplification to Lei et al (2017) and consider a slightly different problem. When the inventory is depleted, instead of forced to set p ∞ for all types of customers, the firm can still use prices between [p, p].…”

Section: Discussionmentioning

confidence: 99%

“…Therefore, the firm has to balance the exploration/exploitation trade-off, which is usually referred to as the learning-andearning problem in this line of literature. Among them, our paper is related to those with nonparametric formulations and inventory constraints (Besbes and Zeevi, 2009;Wang et al, 2014;Lei et al, 2017). In addition, we consider personalized dynamic pricing for multiple types of customers, while most of the above papers consider a single type.…”

Section: Literature Reviewmentioning

confidence: 99%

See 1 more Smart Citation

A Primal-dual Learning Algorithm for Personalized Dynamic Pricing with an Inventory Constraint

Chen

Gallego²

2018

SSRN Journal

View full text Add to dashboard Cite

A firm is selling a product to different types (based on the features such as education backgrounds, ages, etc.) of customers over a finite season with non-replenishable initial inventory. The type label of an arriving customer can be observed but the demand function associated with each type is initially unknown. The firm sets personalized prices dynamically for each type and attempts to maximize the revenue over the season. We provide a learning algorithm that is near-optimal when the demand and capacity scale in proportion. The algorithm utilizes the primal-dual formulation of the problem and learns the dual optimal solution explicitly. It allows the algorithm to overcome the curse of dimensionality (the rate of regret is independent of the number of types) and sheds light on novel algorithmic designs for learning problems with resource constraints.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Literature Reviewmentioning

confidence: 99%

A Primal-dual Learning Algorithm for Personalized Dynamic Pricing with an Inventory Constraint

Chen

Gallego²

2018

SSRN Journal

View full text Add to dashboard Cite

show abstract

“…The interpretation here is that if the product is sold for v L (can be zero) all consumers will purchase for sure, and if the product is sold for v H , no consumers will buy. Consistent with the robust pricing literature [Bergemann and Schlag, 2008, Besbes and Zeevi, 2009, Handel et al, 2013, Wang and Hu, 2014, Handel and Misra, 2015, Lei et al, 2014 we assume within this range the firm does not know the distribution of consumer preferences across or within segments. Our motivation for this assumption is that it is infeasible for the manager to have credible priors for millions of products.…”

Section: Model Setup and Maintained Assumptionsmentioning

confidence: 98%

“…The firm has to ex-ante set the length of experimentation stage. More recent additions to this literature Wang and Hu [2014] and Lei et al [2014] improve the convergence results, yet the algorithms proposed in these paper also consider distinct phases for exploration then exploitation, or as we refer to it, "learning then earning." Instead, in our paper, we consider the learning and earning phases simultaneously, accounting for the potential value from learning at each point in time.…”

Section: Literature On Pricingmentioning

confidence: 99%

Dynamic Online Pricing with Incomplete Information Using Multiarmed Bandit Experiments

2019

View full text Add to dashboard Cite

Consider the pricing decision for a manager at a large online retailer, that sells millions of products.A manager must decide on real-time prices for each of these products. It is infeasible to have complete knowledge of demand curve for each product. A manager can run price experiments to learn about demand and maximize long run profits. There are two aspects that make this setting different from traditional brick-and-mortar settings. First, due to the number of products the manager must be able to automate pricing. Second, an online retailer can make frequent price changes. In this paper, we propose a dynamic price experimentation policy where the firm has incomplete demand information.For this general setting, we derive a pricing algorithm that balances earning profit immediately and learning for future profits. The proposed approach combines multi-armed bandit (MAB) algorithms statistical machine learning with partial identification of consumer demand from economic theory. Our automated policy solves this problem using a scalable distribution-free algorithm. We show that our method converges to the optimal price faster than standard machine learning MAB solutions to the problem. In a series of Monte Carlo simulations, we show that the proposed approach perform favorably compared to methods in computer science and revenue management.

show abstract

“…In recent years, data-driven sequential decision-making has received a lot of attentions and finds a wide range of applications in operations management, such as dynamic inventory control (see, e.g., Huh et al (2011), Chen and Plambeck (2008), Chen et al (2019b,a), Lei et al (2019)), dynamic pricing (see, e.g., Zeevi (2009, 2015), Wang et al (2014), Chen et al (2019c), Broder and Rusmevichientong (2012)), dynamic assortment optimization (see, e.g., Rusmevichientong and Topaloglu (2012), Saure and Zeevi (2013), Agrawal et al (2019), Wang et al (2018), Chen et al (2018)). Take the personalized/contextual dynamic pricing as an example; it is usually assumed that the underlying demand, which is a function of the price and customer's contextual information, follows a certain probabilistic model with unknown parameters.…”

Section: Introductionmentioning

confidence: 99%

Uncertainty Quantification for Demand Prediction in Contextual Dynamic Pricing

Chen

2020

SSRN Journal

View full text Add to dashboard Cite

Data-driven sequential decision has found a wide range of applications in modern operations management, such as dynamic pricing, inventory control, and assortment optimization. Most existing research on datadriven sequential decision focuses on designing an online policy to maximize the revenue. However, the research on uncertainty quantification on the underlying true model function (e.g., demand function), a critical problem for practitioners, has not been well explored. In this paper, using the problem of demand function prediction in dynamic pricing as the motivating example, we study the problem of constructing accurate confidence intervals for the demand function. The main challenge is that sequentially collected data leads to significant distributional bias in the maximum likelihood estimator or the empirical risk minimization estimate, making classical statistics approaches such as the Wald's test no longer valid. We address this challenge by developing a debiased approach and provide the asymptotic normality guarantee of the debiased estimator. Based this the debiased estimator, we provide both point-wise and uniform confidence intervals of the demand function.

show abstract

Near-Optimal Bisection Search for Nonparametric Dynamic Pricing with Inventory Constraint

Cited by 26 publications

References 39 publications

A Primal-dual Learning Algorithm for Personalized Dynamic Pricing with an Inventory Constraint

A Primal-dual Learning Algorithm for Personalized Dynamic Pricing with an Inventory Constraint

Dynamic Online Pricing with Incomplete Information Using Multiarmed Bandit Experiments

Uncertainty Quantification for Demand Prediction in Contextual Dynamic Pricing

Contact Info

Product

Resources

About