Method

WinMamba: Multi-Scale Shifted Windows in State Space Model for 3D Object Detection [WinMamba]
[Anonymous Submission]

Submitted on 9 Oct. 2025 11:57 by
[Anonymous Submission]

Running time:0.1 s
Environment:1 core @ 2.5 Ghz (Python)

Method Description:
WinMamba is a novel 3D feature-encoding backbone
for object detection, built by stacking WinMamba
blocks. To overcome the spatial information loss
from axis-aligned scanning, it introduces a
window-shift strategy and learnable positional
encoding to enrich contextual cues within the
linear state space. Furthermore, a window-scale-
adaptive module is incorporated to compensate for
voxel features across different resolutions,
thereby enhancing multi-scale representation. This
architecture enables the model to efficiently
capture long-range dependencies while maintaining
computational efficiency.
Parameters:
None
Latex Bibtex:
None

Detailed Results

Object detection and orientation estimation results. Results for object detection are given in terms of average precision (AP) and results for joint object detection and orientation estimation are provided in terms of average orientation similarity (AOS).


Benchmark Easy Moderate Hard
Car (Detection) 95.07 % 93.84 % 92.63 %
Car (Orientation) 95.05 % 93.73 % 92.46 %
Car (3D Detection) 89.72 % 82.97 % 79.79 %
Car (Bird's Eye View) 93.46 % 90.07 % 87.94 %
Pedestrian (Detection) 74.58 % 64.01 % 60.25 %
Pedestrian (Orientation) 72.03 % 60.86 % 56.99 %
Pedestrian (3D Detection) 56.10 % 48.04 % 45.62 %
Pedestrian (Bird's Eye View) 59.75 % 52.59 % 49.13 %
Cyclist (Detection) 92.39 % 84.23 % 78.40 %
Cyclist (Orientation) 92.26 % 83.95 % 78.04 %
Cyclist (3D Detection) 87.84 % 73.86 % 66.76 %
Cyclist (Bird's Eye View) 88.86 % 76.92 % 69.85 %
This table as LaTeX


2D object detection results.
This figure as: png eps txt gnuplot



Orientation estimation results.
This figure as: png eps txt gnuplot



3D object detection results.
This figure as: png eps txt gnuplot



Bird's eye view results.
This figure as: png eps txt gnuplot



2D object detection results.
This figure as: png eps txt gnuplot



Orientation estimation results.
This figure as: png eps txt gnuplot



3D object detection results.
This figure as: png eps txt gnuplot



Bird's eye view results.
This figure as: png eps txt gnuplot



2D object detection results.
This figure as: png eps txt gnuplot



Orientation estimation results.
This figure as: png eps txt gnuplot



3D object detection results.
This figure as: png eps txt gnuplot



Bird's eye view results.
This figure as: png eps txt gnuplot




eXTReMe Tracker