π€ AI Summary
In large-scale homepage recommendation, exposure bias introduces spurious negatives (non-clicks β disinterest) and spurious positives (clicks β genuine interest). Existing methods lack systematic analysis of ineffective exposures and typically address sampling or debiasing in isolation. This paper presents the first systematic modeling of ineffective exposure effects, proposing a user-intent-driven unified framework for joint sampling and debiasing. Specifically, it dynamically selects high-quality negative samples via intent-aware identification, and introduces a dual-debiasing module that jointly corrects exposure bias and click bias in an end-to-end manner. Deployed in Taobaoβs homepage marketing system, online A/B tests demonstrate significant improvements: UCTR increases notably, with Baiyibutie and Taobaomiaosha conversion rates rising by 35.4% and 14.5%, respectively.
π Abstract
Large-scale homepage recommendations face critical challenges from pseudo-negative samples caused by exposure bias, where non-clicks may indicate inattention rather than disinterest. Existing work lacks thorough analysis of invalid exposures and typically addresses isolated aspects (e.g., sampling strategies), overlooking the critical impact of pseudo-positive samples - such as homepage clicks merely to visit marketing portals. We propose a unified framework for large-scale homepage recommendation sampling and debiasing. Our framework consists of two key components: (1) a user intent-aware negative sampling module to filter invalid exposure samples, and (2) an intent-driven dual-debiasing module that jointly corrects exposure bias and click bias. Extensive online experiments on Taobao demonstrate the efficacy of our framework, achieving significant improvements in user click-through rates (UCTR) by 35.4% and 14.5% in two variants of the marketing block on the Taobao homepage, Baiyibutie and Taobaomiaosha.