LoopCTR：解锁循环扩展能力以优化点击率预测

2026-04-21 08:00·73天前

AI 摘要

LoopCTR提出一种循环扩展范式，通过递归重用共享模型层将训练计算与参数增长解耦，采用三明治架构结合超连接残差与混合专家，并在各循环深度实施过程监督。该方法实现"训练多循环、推理零循环"策略，单次前向传播即可超越所有基线。实验在三个公开基准及工业数据集上达到SOTA性能，Oracle分析揭示0.02-0.04 AUC的优化空间，且少循环训练模型展现出更高的自适应推理潜力。

原文 · 未翻译

Scaling Transformer-based click-through rate (CTR) models by stacking more parameters brings growing computational and storage overhead, creating a widening gap between scaling ambitions and the stringent industrial deployment constraints. We propose LoopCTR, which introduces a loop scaling paradigm that increases training-time computation through recursive reuse of shared model layers, decoupling computation from parameter growth. LoopCTR adopts a sandwich architecture enhanced with Hyper-Connected Residuals and Mixture-of-Experts, and employs process supervision at every loop depth to encode multi-loop benefits into the shared parameters. This enables a train-multi-loop, infer-zero-loop strategy where a single forward pass without any loop already outperforms all baselines. Experiments on three public benchmarks and one industrial dataset demonstrate state-of-the-art performance. Oracle analysis further reveals 0.02--0.04 AUC of untapped headroom, with models trained with fewer loops exhibiting higher oracle ceilings, pointing to a promising frontier for adaptive inference.

HuggingFace Daily Papers（社区热门论文）

导出 Markdown

LoopCTR：解锁循环扩展能力以优化点击率预测

2026-04-21 08:00·73天前

阅读原文· arxiv.org

AI 摘要

原文 · 保持原样，未翻译