TY - GEN
T1 - Comparative Analysis of state-of-art pre-trained Human Pose Estimation models in underwater condition
AU - Rivera, Mauricio
AU - Huamanchahua, Deyby
AU - Flores, Christian
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Human Pose Estimation (HPE) is essential in computer vision, with applications in sports, assisted living, and gaming. Pre-trained HPE state-of-the-art models like OpenPose, MoveNet and MediaPipe have strengths and weaknesses that have yet to be discovered (advantages and disadvantages are not completely known yet). While it is true that applying HPE underwater to predict swimmers' movements is not a very explored area due to the difficulties this environment represents, it could bring numerous benefits, especially in the fields of sport and human body correction. The study proposes an analysis of pre-trained models (all layers frozen) performance by evaluating their accuracy predictions over 3 videos of professional swimmers doing a dolphin kick underwater using 3 different processes to see which one fits better to each model: without pre or post-processing (Type 1), with pre-processing (Type 2) and with pre and post-processing (Type 3). Results concluded that MediaPipe with Type 3 processing and a confidence of 50% was the most effective for underwater HPE, while OpenPose and MoveNet did not perform well in these conditions.
AB - Human Pose Estimation (HPE) is essential in computer vision, with applications in sports, assisted living, and gaming. Pre-trained HPE state-of-the-art models like OpenPose, MoveNet and MediaPipe have strengths and weaknesses that have yet to be discovered (advantages and disadvantages are not completely known yet). While it is true that applying HPE underwater to predict swimmers' movements is not a very explored area due to the difficulties this environment represents, it could bring numerous benefits, especially in the fields of sport and human body correction. The study proposes an analysis of pre-trained models (all layers frozen) performance by evaluating their accuracy predictions over 3 videos of professional swimmers doing a dolphin kick underwater using 3 different processes to see which one fits better to each model: without pre or post-processing (Type 1), with pre-processing (Type 2) and with pre and post-processing (Type 3). Results concluded that MediaPipe with Type 3 processing and a confidence of 50% was the most effective for underwater HPE, while OpenPose and MoveNet did not perform well in these conditions.
KW - Computer Vision
KW - dolphin kick
KW - Human Pose Estimation
KW - pre-trained model
KW - underwater movement analysis
UR - http://www.scopus.com/inward/record.url?scp=85208828689&partnerID=8YFLogxK
U2 - 10.1109/COLCOM62950.2024.10720259
DO - 10.1109/COLCOM62950.2024.10720259
M3 - Conference contribution
AN - SCOPUS:85208828689
T3 - 2024 IEEE Colombian Conference on Communications and Computing, COLCOM 2024 - Proceedings
BT - 2024 IEEE Colombian Conference on Communications and Computing, COLCOM 2024 - Proceedings
A2 - Briceno Rodriguez, Diana Z.
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2024 IEEE Colombian Conference on Communications and Computing, COLCOM 2024
Y2 - 21 August 2024 through 24 August 2024
ER -