视频中基于时序空洞卷积的3D人体姿态估计

刘亚欣; 李莉

0
0
浏览
下载

摘要
关键词
基金信息
论文图表
同行评议
相关论文
评论

视频中基于时序空洞卷积的3D人体姿态估计

首发时间：2021-03-24

刘亚欣 ¹
刘亚欣（1995-），女，主要研究方向：计算机视觉、人体姿态估计
李莉 ¹
李莉（1978-），女，副教授，硕导，主要研究方向：人工智能，边缘计算

1、北京邮电大学信息与通信工程学院，北京 100876

摘要：3D人体姿态估计是计算机视觉领域中的热点研究问题，本文主要研究的是基于视频的3D人体姿态估计，为进一步提高识别准确率，本文中提出了一种基于时序卷积的3D人体姿态估计算法。首先，使用先进的目标检测算法和2D人体姿态估计算法检测出视频帧中人体的2D关节位置信息。然后，使用本文中所提出的基于时序卷积的2D-3D姿态提升网络将2D关节位置信息提升至3D空间。与传统的基于视频中单帧图像的3D人体姿态估计算法相比，本文中所提出的模型能够充分利用视频中的时序信息来改善模型识别效果，并解决传统方法中存在的姿态估计结果不连续的问题。通过实验发现本文所提出的模型在Human3.6M数据集上达到了最佳的性能，其姿态估计误差要远低于现有研究中未使用时序信息进行3D人体姿态估计的方法和使用RNN网络对时序信息进行建模的3D人体姿态估计方法。该实验结果验证了本文中所提出的基于时序卷积的3D人体姿态估计算法的有效性和先进性。

关键词：计算机视觉与应用人体姿态估计时序卷积可视化

For information in English, please click here

3D Human Pose Estimation Based on Temporal Dilated Convolution in the Video

LIU Yaxin ¹
刘亚欣（1995-），女，主要研究方向：计算机视觉、人体姿态估计
LI Li ¹
李莉（1978-），女，副教授，硕导，主要研究方向：人工智能，边缘计算

1、School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing 100876

Abstract：3D human pose estimation is a hot topic in computer vision. This paper mainly studies the 3D human pose estimation based on video. In order to further improve the recognition accuracy, a 3D human pose estimation algorithm based on temporal convolution is proposed. Firstly, advanced object detection algorithm and 2D human pose estimation algorithm are used to detect the 2D joint position of human body in video frames. Then, the 2D-3D pose lifting network based on temporal convolution proposed in this paper is used to transform 2D human pose into 3D human pose. Compared with the traditional 3D human pose estimation algorithm based on single frame in video, our proposed model can make full use of the temporal information in video to improve the recognition performance, and solve the problem of discontinuity in traditional methods. Experimental results show that the proposed model achieves the best performance on the Human3.6M dataset, and its estimation error is much lower than the existing methods that do not use temporal information and the methods that use RNN to model temporal information for 3D human pose estimation. The experimental results also verify the effectiveness and progressiveness of the proposed model.

Keywords： Computer Vision and Application Human Pose Estimation Temporal Convolution Visualization

基金：

论文图表：

引用

导出参考文献

.txt

.ris

.doc

刘亚欣，李莉. 视频中基于时序空洞卷积的3D人体姿态估计[EB/OL]. 北京：中国科技论文在线 [2021-03-24]. https://www.paper.edu.cn/releasepaper/content/202103-257.

No.****

同行评议

未申请同行评议

全部评论

0/1000

论文编号	202103-257
论文题目	视频中基于时序空洞卷积的3D人体姿态估计
文献类型
收录期刊	上传封面中文期刊英文期刊期刊名称（中文）期刊名称（英文）年，卷（）上传封面中文专著英文专著书名（中文）书名（英文）出版地出版社出版年上传封面中文译著英文译著书名（中文）书名（英文）出版地出版社出版年上传封面中文论文集英文论文集编者.论文集名称（中文） [c]. 出版地出版社出版年， - 编者.论文集名称（英文） [c]. 出版地出版社出版年，- 上传封面中文文献英文文献期刊名称（中文）期刊名称（英文）日期-- 在线地址http:// 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期-- 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期--
英文作者写法：中外文作者均姓前名后，姓大写，名的第一个字母大写，姓全称写出，名可只写第一个字母，其后不加实心圆点“.”, 作者之间用逗号“，”分隔，最后为实心圆点“.”, 示例1：原姓名写法：Albert Einstein,编入参考文献时写法：Einstein A. 示例2：原姓名写法：李时珍；编入参考文献时写法：LI S Z. 示例3：YELLAND R L,JONES S C,EASTON K S,et al.