基于Transformer修复网络与多视角一致性校正的文物三维重建方法

唐明轩; 张继威

0
0
浏览
下载

摘要
关键词
基金信息
论文图表
同行评议
相关论文
评论

基于Transformer修复网络与多视角一致性校正的文物三维重建方法

首发时间：2026-03-05

唐明轩 ¹
唐明轩（2001-），男，硕士研究生，主要研究方向：文物数字化、计算机视觉
张继威 ¹
张继威（1989-），男，硕导，主要研究方向：多媒体安全、计算机视觉

1、北京邮电大学计算机学院（国家示范性软件学院）,北京,100876

摘要：三维重建是文物数字化保护与虚拟展示的关键技术。然而,当文物表面存在破损时,由此采集的缺损图像将导致重建模型出现几何失真与纹理空洞,而对各视角独立修复所产生的纹理不一致则进一步加剧渲染伪影。为解决上述难题,本文提出一种将图像修复技术与多视角一致性校正相结合的文物三维重建方案。在修复阶段,构建了一种基于Transformer的文物图像修复网络,该网络采用交替注意力融合Transformer模块在通道维度与空间维度之间进行交互式特征建模,并通过序列注意力整合跳跃连接机制缓解多级下采样过程中高频细节的衰减问题。在一致性校正阶段,设计了一种依据几何约束的多视角纹理统一策略,借助锚点视角筛选与深度引导的跨视角纹理映射保障修复结果在不同视角间的外观统一。最终,将校正后的多视角图像输入COLMAP完成位姿求解,并经由3D Gaussian Splatting生成高保真三维模型。实验结果显示,所提修复网络在文物数据集上的PSNR相比次优方法平均高出0.81 dB,一致性校正使跨视角几何一致性指标TSED获得8.3\%的相对提升,重建渲染PSNR增益达0.36 dB,充分证明了该方法在破损文物高质量三维数字复原中的有效性。

关键词：计算机应用技术文物数字化图像修复三维重建多视角一致性 transformer

For information in English, please click here

A Cultural Relic 3D Reconstruction Method Based on Transformer-Based Inpainting Network and Multi-View Consistency Correction

TANG Mingxuan ¹
唐明轩（2001-），男，硕士研究生，主要研究方向：文物数字化、计算机视觉
ZHANG Jiwei ¹
张继威（1989-），男，硕导，主要研究方向：多媒体安全、计算机视觉

1、School of Computer Science (National Pilot Software Engineering School), Beijing University of Posts and Telecommunications, Beijing 100876

Abstract：3D reconstruction is a key technique for the digital preservation and virtual exhibition of cultural relics. However, when relic surfaces are damaged, the resulting defective images lead to geometric distortions and texture voids in the reconstructed models, while independently inpainting each view introduces texture inconsistencies that further aggravate rendering artifacts. To tackle these challenges, this paper presents a cultural relic 3D reconstruction framework that integrates image inpainting with multi-view consistency correction. In the inpainting stage, a Transformer-based cultural relic image inpainting network is constructed, which leverages an alternating attention fusion Transformer module for interactive feature modeling between channel and spatial dimensions, and employs a sequential attention integration skip connection to mitigate the attenuation of high-frequency details during multi-level downsampling. In the consistency correction stage, a geometry-constrained multi-view texture unification strategy is designed, utilizing anchor view selection and depth-guided cross-view texture mapping to ensure appearance uniformity of inpainted results across different viewpoints. Finally, the corrected multi-view images are fed into COLMAP for pose estimation, followed by high-fidelity 3D modeling via 3D Gaussian Splatting. Experimental results show that the proposed inpainting network achieves an average PSNR gain of 0.81 dB over the runner-up method on the cultural relic dataset, while the consistency correction yields an 8.3\% relative improvement in the cross-view geometric consistency metric TSED and a 0.36 dB increase in reconstruction rendering PSNR, fully demonstrating the effectiveness of the proposed method for high-quality 3D digital restoration of damaged cultural relics.

Keywords： computer application technology cultural heritage digitization image inpainting 3D reconstruction multi-view consistency transformer

基金：

1. 国家重点研发计划项目（2024YFF0907404）

论文图表：

引用

导出参考文献

.txt

.ris

.doc

唐明轩，张继威. 基于Transformer修复网络与多视角一致性校正的文物三维重建方法[EB/OL]. 北京：中国科技论文在线 [2026-03-05]. https://www.paper.edu.cn/releasepaper/content/202603-62.

No.****

同行评议

未申请同行评议

全部评论

0/1000

论文编号	202603-62
论文题目	基于Transformer修复网络与多视角一致性校正的文物三维重建方法
文献类型
收录期刊	上传封面中文期刊英文期刊期刊名称（中文）期刊名称（英文）年，卷（）上传封面中文专著英文专著书名（中文）书名（英文）出版地出版社出版年上传封面中文译著英文译著书名（中文）书名（英文）出版地出版社出版年上传封面中文论文集英文论文集编者.论文集名称（中文） [c]. 出版地出版社出版年， - 编者.论文集名称（英文） [c]. 出版地出版社出版年，- 上传封面中文文献英文文献期刊名称（中文）期刊名称（英文）日期-- 在线地址http:// 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期-- 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期--
英文作者写法：中外文作者均姓前名后，姓大写，名的第一个字母大写，姓全称写出，名可只写第一个字母，其后不加实心圆点“.”, 作者之间用逗号“，”分隔，最后为实心圆点“.”, 示例1：原姓名写法：Albert Einstein,编入参考文献时写法：Einstein A. 示例2：原姓名写法：李时珍；编入参考文献时写法：LI S Z. 示例3：YELLAND R L,JONES S C,EASTON K S,et al.