基于Transformer修复网络与多视角一致性校正的文物三维重建方法
首发时间:2026-03-05
摘要:三维重建是文物数字化保护与虚拟展示的关键技术。然而,当文物表面存在破损时,由此采集的缺损图像将导致重建模型出现几何失真与纹理空洞,而对各视角独立修复所产生的纹理不一致则进一步加剧渲染伪影。为解决上述难题,本文提出一种将图像修复技术与多视角一致性校正相结合的文物三维重建方案。在修复阶段,构建了一种基于Transformer的文物图像修复网络,该网络采用交替注意力融合Transformer模块在通道维度与空间维度之间进行交互式特征建模,并通过序列注意力整合跳跃连接机制缓解多级下采样过程中高频细节的衰减问题。在一致性校正阶段,设计了一种依据几何约束的多视角纹理统一策略,借助锚点视角筛选与深度引导的跨视角纹理映射保障修复结果在不同视角间的外观统一。最终,将校正后的多视角图像输入COLMAP完成位姿求解,并经由3D Gaussian Splatting生成高保真三维模型。实验结果显示,所提修复网络在文物数据集上的PSNR相比次优方法平均高出0.81 dB,一致性校正使跨视角几何一致性指标TSED获得8.3\%的相对提升,重建渲染PSNR增益达0.36 dB,充分证明了该方法在破损文物高质量三维数字复原中的有效性。
关键词: 计算机应用技术 文物数字化 图像修复 三维重建 多视角一致性 transformer
For information in English, please click here
A Cultural Relic 3D Reconstruction Method Based on Transformer-Based Inpainting Network and Multi-View Consistency Correction
Abstract:3D reconstruction is a key technique for the digital preservation and virtual exhibition of cultural relics. However, when relic surfaces are damaged, the resulting defective images lead to geometric distortions and texture voids in the reconstructed models, while independently inpainting each view introduces texture inconsistencies that further aggravate rendering artifacts. To tackle these challenges, this paper presents a cultural relic 3D reconstruction framework that integrates image inpainting with multi-view consistency correction. In the inpainting stage, a Transformer-based cultural relic image inpainting network is constructed, which leverages an alternating attention fusion Transformer module for interactive feature modeling between channel and spatial dimensions, and employs a sequential attention integration skip connection to mitigate the attenuation of high-frequency details during multi-level downsampling. In the consistency correction stage, a geometry-constrained multi-view texture unification strategy is designed, utilizing anchor view selection and depth-guided cross-view texture mapping to ensure appearance uniformity of inpainted results across different viewpoints. Finally, the corrected multi-view images are fed into COLMAP for pose estimation, followed by high-fidelity 3D modeling via 3D Gaussian Splatting. Experimental results show that the proposed inpainting network achieves an average PSNR gain of 0.81 dB over the runner-up method on the cultural relic dataset, while the consistency correction yields an 8.3\% relative improvement in the cross-view geometric consistency metric TSED and a 0.36 dB increase in reconstruction rendering PSNR, fully demonstrating the effectiveness of the proposed method for high-quality 3D digital restoration of damaged cultural relics.
Keywords: computer application technology cultural heritage digitization image inpainting 3D reconstruction multi-view consistency transformer
引用

No.****
同行评议
勘误表
基于Transformer修复网络与多视角一致性校正的文物三维重建方法
评论
全部评论