🤖 AI Summary
For costly black-box optimization problems requiring human expert pairwise comparisons, existing preference-based Bayesian optimization (PBO) methods neglect both the computational cost of candidate solution generation and the perceptual uncertainty inherent in human evaluators—specifically, the just-noticeable difference (JND) threshold. This paper introduces a continuous-preference Bayesian optimization framework that, for the first time, jointly and explicitly models generation cost and perceptual uncertainty. We propose an information-driven sampling strategy that integrates the JND threshold with expected information gain, grounded in a probabilistic preference model. The method operates under realistic constraints, including indifference intervals in feedback and high generation costs, significantly improving convergence accuracy and sample efficiency. Experiments demonstrate that, under identical evaluation budgets, our approach achieves an average 23.6% improvement in accuracy over baseline PBO methods.
📝 Abstract
Preferential Bayesian optimization allows optimization of objectives that are either expensive or difficult to measure directly, by relying on a minimal number of comparative evaluations done by a human expert. Generating candidate solutions for evaluation is also often expensive, but this cost is ignored by existing methods. We generalize preference-based optimization to explicitly account for production and evaluation costs with Consecutive Preferential Bayesian Optimization, reducing production cost by constraining comparisons to involve previously generated candidates. We also account for the perceptual ambiguity of the oracle providing the feedback by incorporating a Just-Noticeable Difference threshold into a probabilistic preference model to capture indifference to small utility differences. We adapt an information-theoretic acquisition strategy to this setting, selecting new configurations that are most informative about the unknown optimum under a preference model accounting for the perceptual ambiguity. We empirically demonstrate a notable increase in accuracy in setups with high production costs or with indifference feedback.