上下文多实例学习
阅读原文· arxiv.org多实例学习(MIL)解决监督信号仅存在于包级别的问题,现有算法在低标注数据场景下表现不佳。本文提出在合成数据上预训练一个基于Perceiver架构的上下文学习器,能通过少量标注包解决新任务,推理时仅需单次前向传播,无需梯度更新。研究比较了多种包结构合成数据生成器,发现其互补的归纳偏置经混合预训练后能继承各自优势,在12个MIL基准测试上取得平均最佳性能,超越需要任务特定训练的监督基线。
Multiple Instance Learning (MIL) addresses problems where supervision is available at the level of bags of instances and has been successfully applied in fields ranging from computational pathology to satellite imagery. Nevertheless, existing algorithms struggle in the low-label regime that characterizes many real-world applications. Flexible models overfit and rigid ones fail to adapt to the task at hand. We show that pretraining an in-context learner with a Perceiver-style architecture on synthetic data yields a model that can solve new tasks from a handful of labeled bags. At inference time, classification happens in a single forward pass and requires no gradient updates. We propose and investigate different synthetic data generators for bag-structured data and find that they capture complementary inductive biases. A model pretrained on a mixture of these generators inherits their per-task strengths and achieves the best average performance across twelve MIL benchmarks, outperforming supervised baselines that require task-specific training.