Efficient Monocular Human Pose Estimation Based on Deep Learning Methods: A Survey

Loading...
Thumbnail Image

Date

2024-05-09

DOI

Open Access Location

Journal Title

Journal ISSN

Volume Title

Publisher

IEEE

Rights

(c) The author/s
CC BY-NC-ND

Abstract

Human pose estimation (HPE) is a crucial computer vision task with a wide range of applications in sports medicine, healthcare, virtual reality, and human-computer interaction. The demand for real-time HPE solutions necessitates the development of efficient deep-learning models that can be deployed on resource-constrained devices. While a few surveys exist in this area, none delve deeply into the critical intersection of efficiency and performance. This survey reviews the state-of-the-art efficient deep learning approaches for real-time HPE, focusing on strategies for improving efficiency without compromising accuracy. We discuss popular backbone networks for HPE, model compression techniques, network pruning and quantization, knowledge distillation, and neural architecture search methods. Furthermore, we critically analyze the existing works, highlighting their strengths, weaknesses, and applicability to different scenarios. We also present an overview of the evaluation datasets, metrics, and design for efficient HPE. Finally, we identify research gaps and challenges in the field, providing insights and recommendations for future research directions in developing efficient and scalable HPE solutions.

Description

Keywords

Survey, 2D human pose estimation, 3D human pose estimation, deep learning, efficiency

Citation

Yan X, Liu B, Qu G. (2024). Efficient Monocular Human Pose Estimation Based on Deep Learning Methods: A Survey. IEEE Access. 12. (pp. 72650-72661).

Collections

Endorsement

Review

Supplemented By

Referenced By

Creative Commons license

Except where otherwised noted, this item's license is described as (c) The author/s