In this paper we propose a closed-form asymptotic expansion approach to pricing discretely monitored Asian options in general one-dimensional diffusion models. Our expansion is a small-time expansion because the expansion parameter is selected to be the square root of the length of monitoring interval. This expansion method is distinguished from many other pricing-oriented expansion algorithms in the literature due to two appealing features. First, we illustrate that it is possible to explicitly calculate not only the first several expansion terms but also any general expansion term in a systematic way. Second, the convergence of the expansion is proved rigorously under some regularity conditions. Numerical experiments suggest that the closed-form expansion formula with only a few terms (e.g., four terms up to the third order) is accurate, fast, and easy to implement for a broad range of diffusion models, even including those violating the regularity conditions.
We study a stylized dynamic assortment planning problem during a selling season of finite length T. At each time period, the seller offers an arriving customer an assortment of substitutable products and the customer makes the purchase among offered products according to a discrete choice model. The goal of the seller is to maximize the expected revenue, or equivalently, to minimize the worst‐case expected regret. One key challenge is that utilities of products are unknown to the seller and need to be learned. Although the dynamic assortment planning problem has received increasing attention in revenue management, most existing work is based on the multinomial logit choice models (MNL). In this paper, we study the problem of dynamic assortment planning under a more general choice model—the nested logit model, which models hierarchical choice behavior and is “the most widely used member of the GEV (generalized extreme value) family” (Train 2009). By leveraging the revenue‐ordered structure of the optimal assortment within each nest, we develop a novel upper confidence bound (UCB) policy with an aggregated estimation scheme. Our policy simultaneously learns customers’ choice behavior and makes dynamic decisions on assortments based on the current knowledge. It achieves the accumulated regret at the order of Ofalse~false(MNTfalse), where M is the number of nests and N is the number of products in each nest. We further provide a lower bound result of Ω(italicMT), which shows the near optimality of the upper bound when T is much larger than M and N. When the number of items per nest N is large, we further provide a discretization heuristic for better performance of our algorithm. Numerical results are presented to demonstrate the empirical performance of our proposed algorithms.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.