Also on Google Scholar, ResearchGate, DBLP
(Hover over the image to magnify it)
Short-wave infrared (SWIR) imaging has emerged as a promising modality for autonomous driving, yet its practical benefits over RGB remain poorly characterized across diverse conditions. This paper presents a systematic comparative study of paired RGB and SWIR object detection on the RASMD dataset, covering four weather conditions and two real-time detection architectures, with various fine-tunings evaluated against a unified ground truth. Overall, RGB demonstrates comparable or superior performance in most scenarios, while RF-DETR exhibits greater robustness across varying conditions. Beyond aggregate metrics, we propose a sensor-dominance mining framework that combines multi-model agreement with targeted manual inspection to identify scenarios where one sensing modality provides more reliable detections using largely unannotated paired data. This analysis reveals that SWIR offers clear advantages in four safety-critical situations, including windshield glare, water droplets on the windshield, low-contrast object visibility, and long-range vehicle detection. The findings suggest that SWIR should be viewed as a complementary modality that enhances perception in rare but challenging conditions. The datasets will be available upon request, and all code and trained model weights will be publicly released at https://github.com/comsee-research/swir-adverse-env-analysis.
SWIR imaging represents a promising approach for enhancing road perception in adverse conditions, though it remains constrained by the scarcity of annotated data compared to models pre-trained in the visible spectrum. This paper presents a comparative study of adaptation strategies for YOLOv8 in the context of multi-class road infrared detection. We demonstrate that partial network freezing offers an efficiency-performance trade-off comparable to full fine-tuning, while highlighting that a structural performance gap consistently persists in favor of the visible domain. Finally, evaluating image synthesis through image-to-image translation and training on mixed datasets reveals effective strategies for mitigating the lack of real-world data, although these methods do not fully bridge the gap between modalities.
@InProceedings{Mehra_2025_ICCV,
author = {Mehra, Rohan and Riffard, Alexandre and Labussière, Mathieu and Duthon, Pierre and Aufrère, Romuald},
title = {Would SWIR modality help for detection and segmentation in harsh weather conditions? An experimental study.},
booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops},
month = {October},
year = {2025},
pages = {2211-2219}
}
Find all my projects and experiences in CV!