Efficient Monocular Human Pose Estimation Based on Deep Learning Methods: A Survey

Human pose estimation (HPE) is a crucial computer vision task with a wide range of applications in sports medicine, healthcare, virtual reality, and human-computer interaction. The demand for real-time HPE solutions necessitates the development of efficient deep-learning models that can be deployed on resource-constrained devices. While a few surveys exist in this area, none delve deeply into the critical intersection of efficiency and performance. This survey reviews the state-of-the-art efficient deep learning approaches for real-time HPE, focusing on strategies for improving efficiency without compromising accuracy. We discuss popular backbone networks for HPE, model compression techniques, network pruning and quantization, knowledge distillation, and neural architecture search methods. Furthermore, we critically analyze the existing works, highlighting their strengths, weaknesses, and applicability to different scenarios. We also present an overview of the evaluation datasets, metrics, and design for efficient HPE. Finally, we identify research gaps and challenges in the field, providing insights and recommendations for future research directions in developing efficient and scalable HPE solutions.

Keywords

Survey, 2D human pose estimation, 3D human pose estimation, deep learning, efficiency

Citation

Yan X, Liu B, Qu G. (2024). Efficient Monocular Human Pose Estimation Based on Deep Learning Methods: A Survey. IEEE Access. 12. (pp. 72650-72661).

URI

https://mro.massey.ac.nz/handle/10179/71552

Collections

Journal Articles

Creative Commons license

Except where otherwised noted, this item's license is described as (c) The author/s

Full item page

Efficient Monocular Human Pose Estimation Based on Deep Learning Methods: A Survey

Files

Date

DOI

Open Access Location

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Rights

Abstract

Description

Keywords

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By

Creative Commons license