视差神经算子

李丹阳; 蒋砚军

0
0
浏览
下载

摘要
关键词
基金信息
论文图表
动态公开评议
相关论文
评论

视差神经算子

首发时间：2024-02-05

李丹阳 ¹
李丹阳(1999-)，女，主要研究方向：计算机视觉
蒋砚军 ¹
蒋砚军(1967-)，男，副教授，硕导，主要研究方向：计算机网络及应用

1、北京邮电大学计算机学院，北京 100876

摘要：训练深度神经网络是双目立体视觉任务中的常用方法，将训练好的模型应用到实际场景时，普遍存在两个问题。首先，固定的图像缩放尺度无法满足大分辨率图像预测任务的需求。其次，由于计算资源的限制，现有的视差估计深度神经网络只能在预定义的视差范围内以固定步长计算Cost Volume，这无法适应需要不同深度感知的场景。这两个问题限制了模型的适用性和泛化能力。为了解决上述的两个问题，本文提出了视差神经算子(DispNO)。DispNO学习从立体图像函数空间到视差函数空间的映射，能够支持任意尺度的视差图生成，更重要的是，DispNO可以适应深度感知不同的立体视觉任务。实验证明，相较于先进的方法，DispNO具有更强的适用性和性能，为解决立体视觉任务中的空间分辨率和深度感知适应性问题提供了有效的解决方案。

关键词：计算机视觉视差估计神经算子多尺度学习双目立体匹配

For information in English, please click here

Disparity Map Neural Operator

Li Danyang ¹
李丹阳(1999-)，女，主要研究方向：计算机视觉
Jiang Yanjun ¹
蒋砚军(1967-)，男，副教授，硕导，主要研究方向：计算机网络及应用

1、School of Computer,Beijing University of Posts and Telecommunications,Beijing 100876

Abstract：Training deep neural network is a common approach in stereo vision tasks. However, when applying the trained models to real-world scenarios, two prevalent issues often arise. Firstly, fixed image scale cannot meet the demands of high-resolution image prediction tasks. Secondly, due to computational constraints, existing depth estimation neural networks can only construct the Cost Volume with a fixed step size within a predefined disparity range, limiting their adaptability to scenarios requiring diverse depth perception. These challenges constrain the applicability and generalizability of the models. To address these issues, this paper proposes the Disparity Map Neural Operator (DispNO). DispNO learns a mapping from the space of stereo image functions to the space of disparity functions, enabling the generation of disparicy maps at arbitrary scales. Importantly, DispNO adapts to varying depth perception requirements in stereo vision tasks. Experimental results demonstrate that DispNO exhibits stronger applicability and performance compared to state-of-the-art methods. It provides an effective solution to the challenges of spatial resolution and depth perception adaptability in stereo vision tasks.

Keywords： computer vision disparity estimation neural operator multi-scale learning stereo matching

基金：

论文图表：

引用

导出参考文献

.txt

.ris

.doc

李丹阳，蒋砚军. 视差神经算子[EB/OL]. 北京：中国科技论文在线 [2024-02-05]. https://www.paper.edu.cn/releasepaper/content/202402-33.

No.****

动态公开评议

共计0人参与

动态评论进行中

全部评论

0/1000

论文编号	202402-33
论文题目	视差神经算子
文献类型
收录期刊	上传封面中文期刊英文期刊期刊名称（中文）期刊名称（英文）年，卷（）上传封面中文专著英文专著书名（中文）书名（英文）出版地出版社出版年上传封面中文译著英文译著书名（中文）书名（英文）出版地出版社出版年上传封面中文论文集英文论文集编者.论文集名称（中文） [c]. 出版地出版社出版年， - 编者.论文集名称（英文） [c]. 出版地出版社出版年，- 上传封面中文文献英文文献期刊名称（中文）期刊名称（英文）日期-- 在线地址http:// 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期-- 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期--
英文作者写法：中外文作者均姓前名后，姓大写，名的第一个字母大写，姓全称写出，名可只写第一个字母，其后不加实心圆点“.”, 作者之间用逗号“，”分隔，最后为实心圆点“.”, 示例1：原姓名写法：Albert Einstein,编入参考文献时写法：Einstein A. 示例2：原姓名写法：李时珍；编入参考文献时写法：LI S Z. 示例3：YELLAND R L,JONES S C,EASTON K S,et al.