Method

Wavelet-Transform Kolmogorov-Arnold Network for Unsupervised Stereo Matching [WT-kan]
[Anonymous Submission]

Submitted on 26 Feb. 2025 09:25 by
[Anonymous Submission]

Running time:0.12 s
Environment:gpu @ 2.5 Ghz (Python)

Method Description:
Stereo image pairs encode 3D scene information through the correspondences between the left and right images. Traditional CNN-based methods typically rely on cost volume techniques to capture stereo correspondences over large disparities but suffer from large parameter sizes and slow computation speeds. Although existing research has optimized the modeling of stereo correspondences, the overall network architecture still needs improvement. To address these challenges, inspired by the significant achievements of Kolmogorov-Arnold Networks (KANs) in terms of accuracy and interpretability, we propose a new unsupervised stereo matching network, WT-KAN. This network reshapes the traditional U-Net architecture by introducing KAN layers, combining nonlinear learnable activation functions, wavelet transforms, and axial attention mechanisms. It enhances the network's ability to capture details and shapes and establishes long-range dependencies. WT-KAN can more efficiently learn complex stereo c
Parameters:
0.2
Latex Bibtex:
N/A

Detailed Results

This page provides detailed results for the method(s) selected. For the first 20 test images, the percentage of erroneous pixels is depicted in the table. We use the error metric described in Object Scene Flow for Autonomous Vehicles (CVPR 2015), which considers a pixel to be correctly estimated if the disparity or flow end-point error is <3px or <5% (for scene flow this criterion needs to be fulfilled for both disparity maps and the flow map). Underneath, the left input image, the estimated results and the error maps are shown (for disp_0/disp_1/flow/scene_flow, respectively). The error map uses the log-color scale described in Object Scene Flow for Autonomous Vehicles (CVPR 2015), depicting correct estimates (<3px or <5% error) in blue and wrong estimates in red color tones. Dark regions in the error images denote the occluded pixels which fall outside the image boundaries. The false color maps of the results are scaled to the largest ground truth disparity values / flow magnitudes.

Test Set Average

Error D1-bg D1-fg D1-all
All / All 6.53 15.34 8.00
All / Est 6.53 15.34 8.00
Noc / All 6.14 13.83 7.41
Noc / Est 6.14 13.83 7.41
This table as LaTeX

Test Image 0

Error D1-bg D1-fg D1-all
All / All 10.53 4.36 9.68
All / Est 10.53 4.36 9.68
Noc / All 10.53 4.36 9.67
Noc / Est 10.53 4.36 9.67
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 1

Error D1-bg D1-fg D1-all
All / All 11.29 2.59 10.32
All / Est 11.29 2.59 10.32
Noc / All 11.12 2.59 10.15
Noc / Est 11.12 2.59 10.15
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 2

Error D1-bg D1-fg D1-all
All / All 9.08 19.70 9.59
All / Est 9.08 19.70 9.59
Noc / All 8.46 19.70 9.02
Noc / Est 8.46 19.70 9.02
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 3

Error D1-bg D1-fg D1-all
All / All 7.46 7.08 7.42
All / Est 7.46 7.08 7.42
Noc / All 7.26 7.08 7.24
Noc / Est 7.26 7.08 7.24
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 4

Error D1-bg D1-fg D1-all
All / All 8.52 6.23 8.14
All / Est 8.52 6.23 8.14
Noc / All 7.67 6.23 7.43
Noc / Est 7.67 6.23 7.43
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 5

Error D1-bg D1-fg D1-all
All / All 17.55 9.15 16.79
All / Est 17.55 9.15 16.79
Noc / All 16.79 9.15 16.09
Noc / Est 16.79 9.15 16.09
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 6

Error D1-bg D1-fg D1-all
All / All 17.92 2.01 16.24
All / Est 17.92 2.01 16.24
Noc / All 17.56 2.01 15.89
Noc / Est 17.56 2.01 15.89
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 7

Error D1-bg D1-fg D1-all
All / All 3.57 18.85 6.56
All / Est 3.57 18.85 6.56
Noc / All 3.63 18.85 6.65
Noc / Est 3.63 18.85 6.65
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 8

Error D1-bg D1-fg D1-all
All / All 2.73 1.80 2.56
All / Est 2.73 1.80 2.56
Noc / All 2.69 1.80 2.53
Noc / Est 2.69 1.80 2.53
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 9

Error D1-bg D1-fg D1-all
All / All 4.06 3.60 3.94
All / Est 4.06 3.60 3.94
Noc / All 4.12 2.73 3.77
Noc / Est 4.12 2.73 3.77
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 10

Error D1-bg D1-fg D1-all
All / All 2.35 4.73 2.90
All / Est 2.35 4.73 2.90
Noc / All 2.34 4.73 2.89
Noc / Est 2.34 4.73 2.89
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 11

Error D1-bg D1-fg D1-all
All / All 3.63 4.23 3.74
All / Est 3.63 4.23 3.74
Noc / All 3.65 4.23 3.75
Noc / Est 3.65 4.23 3.75
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 12

Error D1-bg D1-fg D1-all
All / All 1.40 0.62 1.34
All / Est 1.40 0.62 1.34
Noc / All 1.38 0.62 1.33
Noc / Est 1.38 0.62 1.33
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 13

Error D1-bg D1-fg D1-all
All / All 1.77 3.10 1.93
All / Est 1.77 3.10 1.93
Noc / All 1.52 3.10 1.72
Noc / Est 1.52 3.10 1.72
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 14

Error D1-bg D1-fg D1-all
All / All 2.86 0.51 2.82
All / Est 2.86 0.51 2.82
Noc / All 2.74 0.51 2.70
Noc / Est 2.74 0.51 2.70
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 15

Error D1-bg D1-fg D1-all
All / All 6.30 0.23 5.75
All / Est 6.30 0.23 5.75
Noc / All 6.24 0.23 5.69
Noc / Est 6.24 0.23 5.69
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 16

Error D1-bg D1-fg D1-all
All / All 7.05 0.59 6.10
All / Est 7.05 0.59 6.10
Noc / All 6.72 0.59 5.81
Noc / Est 6.72 0.59 5.81
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 17

Error D1-bg D1-fg D1-all
All / All 1.52 0.71 1.43
All / Est 1.52 0.71 1.43
Noc / All 1.41 0.71 1.33
Noc / Est 1.41 0.71 1.33
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 18

Error D1-bg D1-fg D1-all
All / All 10.71 8.78 9.79
All / Est 10.71 8.78 9.79
Noc / All 10.31 8.78 9.58
Noc / Est 10.31 8.78 9.58
This table as LaTeX

Input Image

D1 Result

D1 Error


Test Image 19

Error D1-bg D1-fg D1-all
All / All 2.56 1.01 2.38
All / Est 2.56 1.01 2.38
Noc / All 2.50 1.01 2.33
Noc / Est 2.50 1.01 2.33
This table as LaTeX

Input Image

D1 Result

D1 Error




eXTReMe Tracker