A regret lower bound for assortment optimization under the capacitated MNL model with arbitrary revenue parameters
| Authors | |
|---|---|
| Publication date | 10-2022 |
| Journal | Probability in the Engineering and Informational Sciences |
| Volume | Issue number | 36 | 4 |
| Pages (from-to) | 1266-1274 |
| Organisations |
|
| Abstract |
In this note, we consider dynamic assortment optimization with incomplete information under the capacitated multinomial logit choice model. Recently, it has been shown that the regret (the cumulative expected revenue loss caused by offering suboptimal assortments) that any decision policy endures is bounded from below by a constant times √NT, where Ndenotes the number of products and T denotes the time horizon. This result is shown under the assumption that the product revenues are constant, and thus leaves the question open whether a lower regret rate can be achieved for nonconstant revenue parameters. In this note, we show that this is not the case: we show that, for any vector of product revenues there is a positive constant such that the regret of any policy is bounded from below by this constant times √NT. Our result implies that policies that achieve O(√NT) regret are asymptotically optimal for all product revenue parameters.
|
| Document type | Article |
| Language | English |
| Published at | https://doi.org/10.1017/S0269964821000395 |
| Other links | https://www.scopus.com/pages/publications/85114320027 |
| Downloads |
A regret lower bound for assortment optimization under the capacitated MNL model
(Final published version)
|
| Permalink to this page | |