视差神经算子
首发时间:2024-02-05
摘要:训练深度神经网络是双目立体视觉任务中的常用方法,将训练好的模型应用到实际场景时,普遍存在两个问题。首先,固定的图像缩放尺度无法满足大分辨率图像预测任务的需求。其次,由于计算资源的限制,现有的视差估计深度神经网络只能在预定义的视差范围内以固定步长计算Cost Volume,这无法适应需要不同深度感知的场景。这两个问题限制了模型的适用性和泛化能力。为了解决上述的两个问题,本文提出了视差神经算子(DispNO)。DispNO学习从立体图像函数空间到视差函数空间的映射,能够支持任意尺度的视差图生成,更重要的是,DispNO可以适应深度感知不同的立体视觉任务。实验证明,相较于先进的方法,DispNO具有更强的适用性和性能,为解决立体视觉任务中的空间分辨率和深度感知适应性问题提供了有效的解决方案。
关键词: 计算机视觉 视差估计 神经算子 多尺度学习 双目立体匹配
For information in English, please click here
Disparity Map Neural Operator
Abstract:Training deep neural network is a common approach in stereo vision tasks. However, when applying the trained models to real-world scenarios, two prevalent issues often arise. Firstly, fixed image scale cannot meet the demands of high-resolution image prediction tasks. Secondly, due to computational constraints, existing depth estimation neural networks can only construct the Cost Volume with a fixed step size within a predefined disparity range, limiting their adaptability to scenarios requiring diverse depth perception. These challenges constrain the applicability and generalizability of the models. To address these issues, this paper proposes the Disparity Map Neural Operator (DispNO). DispNO learns a mapping from the space of stereo image functions to the space of disparity functions, enabling the generation of disparicy maps at arbitrary scales. Importantly, DispNO adapts to varying depth perception requirements in stereo vision tasks. Experimental results demonstrate that DispNO exhibits stronger applicability and performance compared to state-of-the-art methods. It provides an effective solution to the challenges of spatial resolution and depth perception adaptability in stereo vision tasks.
Keywords: computer vision disparity estimation neural operator multi-scale learning stereo matching
基金:
引用

No.****
动态公开评议
共计0人参与
勘误表
视差神经算子
评论
全部评论