Powered by OpenAIRE graph
Found an issue? Give us feedback
ZENODOarrow_drop_down
ZENODO
Dataset . 2025
License: CC BY
Data sources: Datacite
ZENODO
Dataset . 2025
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

This Research product is the result of merged Research products in OpenAIRE.

You have already added 0 works in your ORCID record related to the merged Research product.

Paving the Way Towards Kinematic Assessment Using Monocular Video: A Benchmark of State-of-the-Art Deep-Learning-Based 3D Human Pose Estimators Against Inertial Sensors in Daily Living Activities

Authors: Medrano-Paredes, Mario; Fernández-González, Carmen; Díaz-Pernas, Francisco-Javier; Saoudi, Hichem; González-Alonso, Javier; Martínez-Zarzuela, Mario;

Paving the Way Towards Kinematic Assessment Using Monocular Video: A Benchmark of State-of-the-Art Deep-Learning-Based 3D Human Pose Estimators Against Inertial Sensors in Daily Living Activities

Abstract

Advances in machine learning and wearable sensors offer new opportunities for capturing and analyzing human movement outside specialized laboratories. Accurately tracking and evaluating human movement under real-world conditions is essential for telemedicine, sports science, and rehabilitation. This work introduces a comprehensive benchmark comparing deep learning monocular video-based human pose estimation models with inertial measurement unit (IMU)-driven methods, leveraging VIDIMU dataset containing a total of 13 clinically relevant activities which were captured using both commodity video cameras and 5 IMUs. Joint angles derived from state-of-the-art deep learning frameworks (MotionAGFormer, MotionBERT, MMPose 2D-to-3D pose lifting, and NVIDIA BodyTrack included in Maxine-AR-SDK) were evaluated against joint angles computed from IMU data using OpenSim inverse kinematic methods. A graphical comparison of the angles estimated by each model shows the overall performance for each activity. The results, which also contains the evaluation of multiple metrics (RMSE, NMRSE, MAE, correlation and coefficient of determination) in table and plot format, highlight key trade-offs between video- and sensor-based approaches including costs, accessibility and precision across different daily life activities. This work establishes valuable guidelines for researchers and clinicians seeking to develop robust, cost-effective, and user-friendly solutions for telehealth and remote patient monitoring solutions, ultimately bridging the gap between AI-driven motion capture and accessible healthcare applications.

Related Organizations
Keywords

Human Pose Estimation, Deep Learning, Kinematic Assessment, Computer Vision, Biomechanics, IMU

  • BIP!
    Impact byBIP!
    citations
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
citations
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average