Optimizing Online Advertising with Multi-Armed Bandits: Mitigating the Cold Start Problem under Auction Dynamics

📅 2025-02-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Online advertising faces dual challenges: cold-start for new ads (inaccurate CTR estimation due to insufficient click data) and chronic CTR underestimation for mature ads (causing long-term revenue loss from early performance bias). Method: We propose a UCB-type multi-armed bandit algorithm tailored to the Position-Based Model (PBM) and integrated with the pay-per-click (PPC) auction mechanism. Contribution/Results: This is the first work to derive a theoretically provable upper bound on budgeted regret within the PBM framework. Our algorithm features a controllable exploration–exploitation trade-off, optimizing long-term platform revenue while satisfying short-term income constraints. Experiments on synthetic data and real-world ad logs demonstrate significant improvements: enhanced exposure efficiency and CTR estimation accuracy for new ads; 12.7% higher long-term revenue; and a 34% reduction in short-term income volatility.

Technology Category

Application Category

📝 Abstract
Online advertising platforms often face a common challenge: the cold start problem. Insufficient behavioral data (clicks) makes accurate click-through rate (CTR) forecasting of new ads challenging. CTR for"old"items can also be significantly underestimated due to their early performance influencing their long-term behavior on the platform. The cold start problem has far-reaching implications for businesses, including missed long-term revenue opportunities. To mitigate this issue, we developed a UCB-like algorithm under multi-armed bandit (MAB) setting for positional-based model (PBM), specifically tailored to auction pay-per-click systems. Our proposed algorithm successfully combines theory and practice: we obtain theoretical upper estimates of budget regret, and conduct a series of experiments on synthetic and real-world data that confirm the applicability of the method on the real platform. In addition to increasing the platform's long-term profitability, we also propose a mechanism for maintaining short-term profits through controlled exploration and exploitation of items.
Problem

Research questions and friction points this paper is trying to address.

Address cold start in online advertising
Improve CTR forecasting accuracy
Balance exploration and exploitation in auctions
Innovation

Methods, ideas, or system contributions that make the work stand out.

UCB-like algorithm application
Multi-armed bandit framework
Positional-based model optimization
🔎 Similar Papers
No similar papers found.
A
Anastasiia Soboleva
Avito, MSU Institute for Artificial Intelligence, Moscow Institute of Physics and Technology
Andrey Pudovikov
Andrey Pudovikov
Moscow State University
Algorithmic Game TheoryAuto-BiddingOnline Optimization
R
Roman Snetkov
Avito
A
Alina Babenko
Avito
Egor Samosvat
Egor Samosvat
Yandex
Yuriy Dorn
Yuriy Dorn
Moscow State University
Оптимизациятеория игр