Optimal Semiparametric Dynamic Pricing with Feature Diversity

📅 2026-05-05
📈 Citations: 0
Influential: 0
📄 PDF

career value

167K/year
📝 Abstract
We study contextual dynamic pricing under a semiparametric demand model in which the purchase probability is $1-F(p-m(\mathbf{x}))$, where $m(\mathbf{x})$ captures mean utility as a function of product features and buyer covariates, and $F$ is an unknown market-noise distribution. Existing methods either incur suboptimal regret or rely on restrictive structural assumptions. We propose a stagewise greedy pricing algorithm that iteratively refines the estimate of $F$ via local polynomial regression while pricing greedily with current estimates. By exploiting feature diversity, the algorithm reuses endogenous samples collected during exploitation for nonparametric estimation, avoiding costly global random exploration used in prior work. We establish a general regret bound that applies to any estimator $\hat m$ of the utility function, and derive explicit rates for linear, nonparametric additive, and sparse linear classes of $m$. For the linear class, our regret scales as $T^{\max\{1/2,\,3/(2β+1)\}}$, where $β$ is the smoothness of $F$ and $T$ is the time horizon. This improves the best known rates for semiparametric contextual pricing and achieves the parametric $\sqrt{T}$ rate when $β\ge 5/2$. We further prove a matching lower bound, showing the optimality of our rate, and present numerical experiments that corroborate the theory and demonstrate the practical advantages of iterative refinement.
Problem

Research questions and friction points this paper is trying to address.

dynamic pricing
semiparametric demand
feature diversity
regret minimization
contextual pricing
Innovation

Methods, ideas, or system contributions that make the work stand out.

semiparametric dynamic pricing
feature diversity
local polynomial regression
regret optimality
endogenous sample reuse