VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space

Name: VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space
Keywords: FOS: Computer and information sciences, Classificació INSPEC::Pattern recognition::Computer vision, [INFO.INFO-CV] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV], Human mesh recovery, Human pose and shape estimation, Transformers, Computer Vision and Pattern Recognition (cs.CV), Àrees temàtiques de la UPC::Informàtica::Automàtica i control, Computer Science - Computer Vision and Pattern Recognition, Vector quantized autoencoder

Fiche, Guénolé; Leglaive, Simon; Alameda-Pineda, Xavier; Agudo, Antonio; Moreno-Noguer, Francesc

Found an issue? Give us feedback

downloadFull-Text

UPCommons. Portal de...arrow_drop_down

UPCommons. Portal del coneixement obert de la UPC

Part of book or chapter of book . 2024 . Peer-reviewed

Full-Text: https://upcommons.upc.edu/bitstreams/f07246cb-e80b-44e5-9c1e-8385891bdd47/download

Data sources: UPCommons. Portal del coneixement obert de la UPC

arXiv.org e-Print Archive

Preprint . 2023

Data sources: arXiv.org e-Print Archive

Recolector de Ciencia Abierta, RECOLECTA

Part of book or chapter of book . 2024 . Peer-reviewed

Data sources: Recolector de Ciencia Abierta, RECOLECTA

https://doi.org/10.1007/978-3-...

Part of book or chapter of book . 2024 . Peer-reviewed

License: Springer Nature TDM

Data sources: Crossref

Recolector de Ciencia Abierta, RECOLECTA

Article . 2025 . Peer-reviewed

Data sources: Recolector de Ciencia Abierta, RECOLECTA

INRIA2

Conference object . 2024

License: CC BY

Data sources: INRIA2

DIGITAL.CSIC

Article . 2025 . Peer-reviewed

Data sources: DIGITAL.CSIC

HAL-Rennes 1

Conference object . 2024

License: CC BY

Data sources: HAL-Rennes 1

INRIA a CCSD electronic archive server

Conference object . 2024

License: CC BY

Data sources: INRIA a CCSD electronic archive server

UPCommons

Part of book or chapter of book . 2024

Data sources: Bielefeld Academic Search Engine (BASE)

https://dx.doi.org/10.48550/ar...

Article . 2023

License: arXiv Non-Exclusive Distribution

Data sources: Datacite

DBLP

Conference object

Data sources: DBLP

DBLP

Article

Data sources: DBLP

VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space

descriptionPublicationkeyboard_double_arrow_right Part of book or chapter of book , Article , Preprint , Conference object 29 Nov 2024Embargo end date: 01 Jan 2023 Spain, France, Spain, Spain English Publisher:Springer Nature Switzerland

Authors: Fiche, Guénolé; Leglaive, Simon; Alameda-Pineda, Xavier; Agudo, Antonio; Moreno-Noguer, Francesc;

doi: 10.1007/978-3-031-72943-0_27 , 10.48550/arxiv.2312.08291

arXiv: 2312.08291

handle: 2117/425811 , 10261/388303

VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space

- Summary
- Subjects
- Metrics

Abstract

Previous works on Human Pose and Shape Estimation (HPSE) from RGB images can be broadly categorized into two main groups: parametric and non-parametric approaches. Parametric techniques leverage a low-dimensional statistical body model for realistic results, whereas recent non-parametric methods achieve higher precision by directly regressing the 3D coordinates of the human body mesh. This work introduces a novel paradigm to address the HPSE problem, involving a low-dimensional discrete latent representation of the human mesh and framing HPSE as a classification task. Instead of predicting body model parameters or 3D vertex coordinates, we focus on predicting the proposed discrete latent representation, which can be decoded into a registered human mesh. This innovative paradigm offers two key advantages. Firstly, predicting a low-dimensional discrete representation confines our predictions to the space of anthropomorphic poses and shapes even when little training data is available. Secondly, by framing the problem as a classification task, we can harness the discriminative power inherent in neural networks. The proposed model, VQ-HPS, predicts the discrete latent representation of the mesh. The experimental results demonstrate that VQ-HPS outperforms the current state-of-the-art non-parametric approaches while yielding results as realistic as those produced by parametric methods when trained with little data. VQ-HPS also shows promising results when training on large-scale datasets, highlighting the significant potential of the classification approach for HPSE. See the project page at https://g-fiche.github.io/research-pages/vqhps/

Countries

Spain, France, Spain, Spain

Related Organizations

Spanish National Research Council
Spain
Universitat Politècnica de Catalunya
Spain
Grenoble Alpes University
France
French Institute for Research in Computer Science and Automation
France
University of Rennes 1
France

View all View all

Keywords

FOS: Computer and information sciences, Classificació INSPEC::Pattern recognition::Computer vision, [INFO.INFO-CV] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV], Human mesh recovery, Human pose and shape estimation, Transformers, Computer Vision and Pattern Recognition (cs.CV), Àrees temàtiques de la UPC::Informàtica::Automàtica i control, Computer Science - Computer Vision and Pattern Recognition, Vector quantized autoencoder, 004

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	4
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

4

Top 10%

Average

Green

Related to Research communities

INRIA