面向应用对地观测的组合图像检索基准评测
阅读原文· arxiv.org论文建立了针对遥感组合图像检索(RSCIR)的统一基准评测框架。研究在PatternCom数据集上,系统评估了六种视觉语言骨干网络支持的代表性组合图像检索方法。同时,引入了一个名为xView2-CIR、以灾害和损毁监测为中心的新数据集。结果表明,无需训练的组合方法可作为遥感图像检索强健且可扩展的基线;而以变化为中心的检索任务,因需保持场景身份不变,带来了与基于属性检索不同的挑战。
Remote sensing composed image retrieval (RSCIR) enables search in large satellite image archives using composed queries that combine a reference image with a textual modifier. Although RSCIR offers a flexible interface for expressing targeted retrieval intent, the transferability of modern composition methods to Earth observation (EO) imagery and their relevance to operational EO workflows remain underexplored. We address this gap through a unified benchmark and an application-oriented study. First, we systematically adapt and evaluate representative composed image retrieval methods with six vision-language backbones on PatternCom under a standardized protocol, analyzing their behavior across backbones, composition strategies, and query types. Second, we introduce xView2-CIR, a change-centric dataset for disaster and damage monitoring, where retrieval is conditioned on scene identity and a target post-event state. Our results show that training-free composition methods provide strong and scalable baselines for EO retrieval, while change-centric retrieval presents different challenges from attribute-based retrieval, particularly due to the need to preserve scene identity. Overall, this study establishes a practical benchmark for RSCIR and positions composed retrieval as a complementary tool for remote sensing image retrieval, archive exploration, and change analysis. The dataset and code are available at https://github.com/billpsomas/rscir.