Sparse to Dense Dynamic 3D Facial Expression Generation

descriptionPublicationkeyboard_double_arrow_right Article , Conference object , Preprint 01 Jun 2022Embargo end date: 01 Jan 2021 France, Italy, France Publisher:IEEEJournal:2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)Funded by:ANR | ULNE, EC | AI4Media, ANR | Human4D

Authors: Otberdout, Naima; Ferrari, Claudio; Daoudi, Mohamed; Berretti, Stefano; Del Bimbo, Alberto;

doi: 10.1109/cvpr52688.2022.01974 , 10.5281/zenodo.6396130 , 10.48550/arxiv.2105.07463 , 10.5281/zenodo.6396131

arXiv: 2105.07463

handle: 2158/1453054

Sparse to Dense Dynamic 3D Facial Expression Generation

- Summary
- Subjects
- Metrics

Abstract

In this paper, we propose a solution to the task of generating dynamic 3D facial expressions from a neutral 3D face and an expression label. This involves solving two sub-problems: (i) modeling the temporal dynamics of expressions, and (ii) deforming the neutral mesh to obtain the expressive counterpart. We represent the temporal evolution of expressions using the motion of a sparse set of 3D landmarks that we learn to generate by training a manifold-valued GAN (Motion3DGAN). To better encode the expression-induced deformation and disentangle it from the identity information, the generated motion is represented as per-frame displacement from a neutral configuration. To generate the expressive meshes, we train a Sparse2Dense mesh Decoder (S2D-Dec) that maps the landmark displacements to a dense, per-vertex displacement. This allows us to learn how the motion of a sparse set of landmarks influences the deformation of the overall face surface, independently from the identity. Experimental results on the CoMA and D3DFACS datasets show that our solution brings significant improvements with respect to previous solutions in terms of both dynamic expression generation and mesh reconstruction, while retaining good generalization to unseen data. The code and the pretrained model will be made publicly available

Countries

France, Italy, France

Related Organizations

Centrale Lille Institut
France
University of Parma
Italy
University of Lille
France
Fondation I-SITE Université Lille Nord-Europe
France
Université de Lille III (Charles-de-Gaulle)
France

View all View all

Keywords

FOS: Computer and information sciences, 3D from multi-view and sensors; Face and gestures; Image and video synthesis and generation, [INFO.INFO-CV] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV], Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	18
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%