MDFusion:一种多尺度特征解缠的多模态图像融合算法

王浩博; 张海旸

0
0
浏览
下载

摘要
关键词
基金信息
论文图表
动态公开评议
相关论文
评论

MDFusion:一种多尺度特征解缠的多模态图像融合算法

首发时间：2025-03-26

王浩博 ¹
王浩博（2000-），男，硕士研究生。主要研究方向: 计算机视觉。
张海旸 ¹
张海旸，男，北京邮电大学副教授。主要研究方向：涉及多媒体系统与网络、新型网络路由协议、SDN、云计算、网络设备虚拟化等。

1、北京邮电大学计算机学院，北京　100876

摘要：在计算机视觉研究领域,跨模态图像信息整合技术正受到广泛关注。针对现有算法在特征细节提取与跨域适应性方面存在的局限性,本研究创新性地构建了MDFusion多尺度特征分离架构。该架构的核心在于提出了一种新颖的特征交互解耦机制,通过多维度跨模态信息提取策略,显著降低了特征间的相互干扰,有效提升了图像细节的保持度。在技术实现层面,本研究对UNETR网络架构进行了优化改进,通过引入多通道信息处理机制,增强了模型对源图像多尺度特征的融合能力,从而实现了更全面的图像信息捕获。这种创新性的架构设计不仅提升了模型的特征学习能力,更在跨领域适应性方面取得了突破性进展。实验验证表明,该框架在无需进行参数调整或额外训练的情况下,即可实现跨领域数据集的直接迁移应用,并在多个评估指标上达到了业界领先水平。这一研究成果为计算机视觉领域的跨模态图像处理提供了新的技术思路和解决方案。

关键词：计算机应用技术;多模态图像融合;计算机视觉;深度学习;多尺度特征解缠

For information in English, please click here

MDFusion:A Multi-Scale Feature Disentangling Framework for Multi-Modality Image Fusion

WANG Haobo ¹
王浩博（2000-），男，硕士研究生。主要研究方向: 计算机视觉。
ZHANG Haiyang ¹
张海旸，男，北京邮电大学副教授。主要研究方向：涉及多媒体系统与网络、新型网络路由协议、SDN、云计算、网络设备虚拟化等。

1、School of Comupter Science,Beijing University of Posts and Telecommunications，beijing 100876

Abstract：In the field of computer vision research, cross-modal image information integration technology is attracting widespread attention. To address the limitations of existing algorithms in feature detail extraction and cross-domain adaptability, this study innovatively constructs the MDFusion multi-scale feature separation architecture. The core of this architecture lies in proposing a novel feature interaction decoupling mechanism, which significantly reduces mutual interference between features and effectively enhances image detail preservation through a multi-dimensional cross-modal information extraction strategy. In terms of technical implementation, this study optimizes and improves the UNETR network architecture by introducing a multi-channel information processing mechanism, thereby enhancing the model's ability to fuse multi-scale features of source images and achieving more comprehensive image information capture. This innovative architectural design not only improves the model's feature learning capability but also achieves breakthrough progress in cross-domain adaptability. Experimental validation demonstrates that this framework can achieve direct transfer application to cross-domain datasets without the need for parameter adjustment or additional training, reaching industry-leading levels on multiple evaluation metrics. This research provides new technical insights and solutions for cross-modal image processing in the field of computer vision.

Keywords： Computer application technologyey Multi modal image fusion Computer vision Deep learning Multi scale feature disentangling

基金：

论文图表：

引用

导出参考文献

.txt

.ris

.doc

王浩博，张海旸. MDFusion:一种多尺度特征解缠的多模态图像融合算法[EB/OL]. 北京：中国科技论文在线 [2025-03-26]. https://www.paper.edu.cn/releasepaper/content/202503-247.

No.****

动态公开评议

共计0人参与

动态评论进行中

全部评论

0/1000

论文编号	202503-247
论文题目	MDFusion:一种多尺度特征解缠的多模态图像融合算法
文献类型
收录期刊	上传封面中文期刊英文期刊期刊名称（中文）期刊名称（英文）年，卷（）上传封面中文专著英文专著书名（中文）书名（英文）出版地出版社出版年上传封面中文译著英文译著书名（中文）书名（英文）出版地出版社出版年上传封面中文论文集英文论文集编者.论文集名称（中文） [c]. 出版地出版社出版年， - 编者.论文集名称（英文） [c]. 出版地出版社出版年，- 上传封面中文文献英文文献期刊名称（中文）期刊名称（英文）日期-- 在线地址http:// 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期-- 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期--
英文作者写法：中外文作者均姓前名后，姓大写，名的第一个字母大写，姓全称写出，名可只写第一个字母，其后不加实心圆点“.”, 作者之间用逗号“，”分隔，最后为实心圆点“.”, 示例1：原姓名写法：Albert Einstein,编入参考文献时写法：Einstein A. 示例2：原姓名写法：李时珍；编入参考文献时写法：LI S Z. 示例3：YELLAND R L,JONES S C,EASTON K S,et al.