基于深度学习的高效人体姿态估计算法的研究与实现

吴志君; 程渤

0
0
浏览
下载

摘要
关键词
基金信息
论文图表
同行评议
相关论文
评论

基于深度学习的高效人体姿态估计算法的研究与实现

首发时间：2024-03-20

吴志君 ¹
吴志君（1999-），男，硕士研究生，主要研究方向：大数据智能处理
程渤 ¹
程渤（1978-），男，教授、博士生导师，主要研究方向专注于机器学习&深度学习、自然语言处理、计算机视觉。

1、北京邮电大学网络与交换技术国家重点实验室，北京 100876

摘要：为了解决目前人体姿态估计算法模型由于参数量巨大，所耗费的算力资源庞大而难以在工业实践中落地的问题，本文提出了一种轻量化人体姿态估计算法模型。该模型以基于坐标分类的方法为估计手段，采用EfficientNetV2作为轻量级神经网络的骨干结构，用于对输入图像进行特征提取。为提升姿态估计的准确性，本文引入了门控注意力单元，以有效挖掘关键点的空间特征。此外，本文提出了一种混合一致性交叉熵，通过将模型的本轮次预测结果与人工标注的数据相结合，作为本轮样本数据的真实概率分布，以降低人工标注数据的误差对模型的负面影响。本文所提出的模型在COCO数据集上获得了71.7 AP的成绩，超过了许多同类轻量化模型。

关键词：计算机科学与技术人体姿态估计深度学习

For information in English, please click here

The research and implementation of an efficient human pose estimation algorithm based on deep learning

Wu Zhijun ¹
吴志君（1999-），男，硕士研究生，主要研究方向：大数据智能处理
Cheng Bo ¹
程渤（1978-），男，教授、博士生导师，主要研究方向专注于机器学习&深度学习、自然语言处理、计算机视觉。

1、State Key Laboratory of Networking and Switching Technology,Beijing University of Post and Telecommunication,Beijing 100876, Beijing, 100876

Abstract：To address the challenge of the current human pose estimation algorithm models being difficult to deploy in industrial practice due to their massive parameter size and the substantial computational resources they consume, this research proposes a lightweight human pose estimation algorithm model. The model employs a coordinate-based classification approach and utilizes EfficientNetV2 as the backbone structure of a lightweight neural network for feature extraction from input images. To enhance pose estimation accuracy, this study introduces gated attention units to effectively explore spatial features of key points. Additionally, a hybrid consistency cross-entropy is proposed, which combines the model\'s current round prediction results with manually annotated data to serve as the true probability distribution of the current round sample data, mitigating the negative impact of manual annotation data errors on the model. The proposed model in this paper achieves a performance of 71.7 AP on the COCO dataset, surpassing many similar lightweight models.

Keywords： Computer Science and Technology Human Pose Estimation Deep Learning

基金：

论文图表：

引用

导出参考文献

.txt

.ris

.doc

吴志君，程渤. 基于深度学习的高效人体姿态估计算法的研究与实现[EB/OL]. 北京：中国科技论文在线 [2024-03-20]. https://www.paper.edu.cn/releasepaper/content/202403-247.

No.****

同行评议

未申请同行评议

全部评论

0/1000

论文编号	202403-247
论文题目	基于深度学习的高效人体姿态估计算法的研究与实现
文献类型
收录期刊	上传封面中文期刊英文期刊期刊名称（中文）期刊名称（英文）年，卷（）上传封面中文专著英文专著书名（中文）书名（英文）出版地出版社出版年上传封面中文译著英文译著书名（中文）书名（英文）出版地出版社出版年上传封面中文论文集英文论文集编者.论文集名称（中文） [c]. 出版地出版社出版年， - 编者.论文集名称（英文） [c]. 出版地出版社出版年，- 上传封面中文文献英文文献期刊名称（中文）期刊名称（英文）日期-- 在线地址http:// 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期-- 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期--
英文作者写法：中外文作者均姓前名后，姓大写，名的第一个字母大写，姓全称写出，名可只写第一个字母，其后不加实心圆点“.”, 作者之间用逗号“，”分隔，最后为实心圆点“.”, 示例1：原姓名写法：Albert Einstein,编入参考文献时写法：Einstein A. 示例2：原姓名写法：李时珍；编入参考文献时写法：LI S Z. 示例3：YELLAND R L,JONES S C,EASTON K S,et al.