基于实例分割的像素级路面病害评估
阅读原文· arxiv.org本研究提出一种基于 Mask R-CNN 实例分割的视觉系统,用于精细化的路面病害分析,并在车辆采集的 UWGB-StreetCrack 数据集上进行评估。研究比较了五种基于 Detectron2 的骨干网络变体。其中,采用 ResNet-101 FPN 骨干的最佳 Mask R-CNN 模型,在项目特定的边界框匹配协议下,达到了 84.23% 的精确率、90.04% 的召回率和 87.04% 的 F1 分数。该模型预测的总体裂纹面积分数为 2.164%,与标注的真实值 2.170% 高度吻合。作为对比,一个基于 CSPDarknet53 的 YOLO 检测器性能显著较低。结果表明,实例分割是处理实地路面图像和估算裂纹面积的实用方向。
Automated pavement distress assessment requires more than image-level classification or coarse bounding box detection, demanding precise localization of thin, branching, and irregular cracks to achieve the geometric precision necessary for maintenance-relevant quantification. This paper presents a vision-based pavement distress analysis system based on Mask R-CNN instance segmentation and evaluates it on UWGB-StreetCrack, a custom field-collected roadway image dataset acquired with a vehicle-mounted smartphone and manually annotated with polygon labels for longitudinal cracks, transverse cracks, alligator cracks, and potholes. Five Detectron2-based Mask R-CNN backbone variants were considered under a consistent fine-tuning protocol. The best-performing model, Mask R-CNN with a ResNet-101 FPN backbone, achieved 84.23% precision, 90.04% recall, and an F1 score of 87.04% under the project-specific bounding-box matching protocol. The same model produced an aggregate predicted crack-area fraction of 2.164%, closely matching the 2.170% ground-truth crack-area fraction. To contextualize the segmentation system against a detector-oriented alternative, a CSPDarknet53-based YOLO detector was also adapted and retrained on the dataset, reaching 27.5% precision and 20.7% recall on the validation protocol. The results show that instance segmentation is a practical direction for field pavement imagery and aggregate crack-area estimation, while also exposing open challenges in annotation consistency, class imbalance, confounder rejection, and mask-level benchmarking.