B空间拥挤:校准LoRA合并中的共享方向
阅读原文· arxiv.orgLoRA合并性能下降的主因是输出矩阵B在不同任务间过度重复使用共享方向,而矩阵A更具任务特异性。本文提出无数据方法Pico,在合并前对B进行预校准,通过降低过度共享方向权重并重新缩放合并更新来减少干扰。该方法可直接集成至Task Arithmetic等现有流程,在数学、编程等8个基准测试中平均准确率提升3.4-8.3个百分点,整体性能最优,甚至超越使用全量数据联合训练的LoRA。
Merging separately trained LoRA adapters is a practical alternative to joint multi-task training, but it often hurts performance. Existing methods usually treat the LoRA update ΔW = BA as a single object and do not distinguish the two LoRA matrices. We show that the main source of LoRA merge interference comes from the output-side matrix B. Across tasks, B repeatedly uses a small set of shared directions, while A remains much more task-specific. As a result, the merged adapter overemphasizes these shared directions, and task-specific information is lost. We propose Pico (Pre-merge interference calibration in output-space), a data-free method that calibrates B before merge by downscaling over-shared directions and then rescaling the merged update. Pico plugs directly into existing merging methods such as Task Arithmetic, TIES, and TSV-M. Across eight different benchmarks from math, coding, finance, and medical domains, Pico improves average accuracy by 3.4-8.3 points over the corresponding base method and achieves the best overall average performance. Pico also enables merged adapters to outperform the LoRA trained with all task data. These results show that LoRA merging works better when the two LoRA matrices are treated separately.