Method

Difference-driven Cross-modal RGB-D Road Segmentation Network(with aug) [DCR-RoadSeg(aug)]
[Anonymous Submission]

Submitted on 27 Nov. 2025 10:15 by
[Anonymous Submission]

Running time:0.01 s
Environment:1 core @ 2.5 Ghz (C/C++)

Method Description:
This work proposes DCR-RoadSeg, a lightweight RGB-
D road segmentation network that jointly addresses
cross-modal feature fusion, high-level context
modeling, and data-level robustness. A Difference-
driven Road-aware Feature Rectification Module
(DR-FRM) exploits the consensus and disagreement
between RGB and depth to adaptively recalibrate
features in both channel and spatial dimensions,
while a Depthwise Large-kernel Pyramidal
Aggregation (DLPA) module efficiently enlarges the
receptive field of high-level features for complex
road geometries. In addition, a geometrically
consistent instance-level RGB-D augmentation
strategy based on cross-dataset copy–paste
enriches occlusion- and illumination-challenging
cases and enhances robustness on small-scale
datasets. Extensive experiments on benchmark road
datasets demonstrate that DCR-RoadSeg achieves
superior accuracy and robustness compared with
existing methods, while maintaining real-time
inference.
Parameters:
lr=1e-4
optimizer=AdamW
epochs=600
Latex Bibtex:

Evaluation in Bird's Eye View


Benchmark MaxF AP PRE REC FPR FNR
UM_ROAD 95.74 % 92.95 % 95.31 % 96.18 % 2.16 % 3.82 %
UMM_ROAD 96.97 % 95.35 % 96.97 % 96.97 % 3.33 % 3.03 %
UU_ROAD 95.65 % 92.40 % 95.33 % 95.97 % 1.53 % 4.03 %
URBAN_ROAD 96.26 % 93.58 % 96.08 % 96.43 % 2.17 % 3.57 %
This table as LaTeX

Behavior Evaluation


Benchmark PRE-20 F1-20 HR-20 PRE-30 F1-30 HR-30 PRE-40 F1-40 HR-40
This table as LaTeX

Road/Lane Detection

The following plots show precision/recall curves for the bird's eye view evaluation.



This figure as: png eps pdf

This figure as: png eps pdf

This figure as: png eps pdf

This figure as: png eps pdf

Distance-dependent Behavior Evaluation

The following plots show the F1 score/Precision/Hitrate with respect to the longitudinal distance which has been used for evaluation.


Visualization of Results

The following images illustrate the performance of the method qualitatively on a couple of test images. We first show results in the perspective image, followed by evaluation in bird's eye view. Here, red denotes false negatives, blue areas correspond to false positives and green represents true positives.



This figure as: png

This figure as: png

This figure as: png

This figure as: png

This figure as: png

This figure as: png

This figure as: png

This figure as: png

This figure as: png

This figure as: png

This figure as: png

This figure as: png

This figure as: png

This figure as: png

This figure as: png

This figure as: png

This figure as: png

This figure as: png


eXTReMe Tracker