3D generation on ImageNet

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object , Report 01 Jan 2023Embargo end date: 01 Jan 2023 Saudi Arabia Publisher:arXivJournal:CoRR, volume abs/2303.01416

Authors: Skorokhodov, Ivan; Siarohin, Aliaksandr; Xu, Yinghao; Ren, Jian; Lee, Hsin-Ying; Wonka, Peter; Tulyakov, Sergey;

doi: 10.48550/arxiv.2303.01416

arXiv: 2303.01416

handle: 10754/690129

3D generation on ImageNet

- Summary
- Subjects
- Related research
  (13)
- Metrics

Abstract

Existing 3D-from-2D generators are typically designed for well-curated single-category datasets, where all the objects have (approximately) the same scale, 3D location, and orientation, and the camera always points to the center of the scene. This makes them inapplicable to diverse, in-the-wild datasets of non-alignable scenes rendered from arbitrary camera poses. In this work, we develop a 3D generator with Generic Priors (3DGP): a 3D synthesis framework with more general assumptions about the training data, and show that it scales to very challenging datasets, like ImageNet. Our model is based on three new ideas. First, we incorporate an inaccurate off-the-shelf depth estimator into 3D GAN training via a special depth adaptation module to handle the imprecision. Then, we create a flexible camera model and a regularization strategy for it to learn its distribution parameters during training. Finally, we extend the recent ideas of transferring knowledge from pre-trained classifiers into GANs for patch-wise trained models by employing a simple distillation-based technique on top of the discriminator. It achieves more stable training than the existing methods and speeds up the convergence by at least 40%. We explore our model on four datasets: SDIP Dogs 256x256, SDIP Elephants 256x256, LSUN Horses 256x256, and ImageNet 256x256, and demonstrate that 3DGP outperforms the recent state-of-the-art in terms of both texture and geometry quality. Code and visualizations: https://snap-research.github.io/3dgp.

ICLR 2023 (Oral)

Country

Saudi Arabia

Related Organizations

King Abdullah University of Science and Technology
Saudi Arabia

Keywords

FOS: Computer and information sciences, Computer Science - Graphics, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Graphics (cs.GR)

13 Research products, page 1 of 2

Three-dimensional printing of a tunable graphene-based elastomer for strain sensors with ultrahigh sensitivity
2019IsAmongTopNSimilarDocuments
Joint estimation and correction of motion and geometric distortion in segmented arterial spin labeling
2021IsAmongTopNSimilarDocuments
3D gel-printing—An additive manufacturing method for producing complex shape parts
2016IsAmongTopNSimilarDocuments
Towards motion-insensitive Arterial Spin Labeling perfusion imaging
2022IsAmongTopNSimilarDocuments
Effect of PCL concentration on PCL/CaSiO3 porous composite scaffolds for bone engineering
2020IsAmongTopNSimilarDocuments
Fabrication and properties of TiC-high manganese steel cermet processed by 3D gel printing
2021IsAmongTopNSimilarDocuments
High yield production of 3D graphene powders by thermal chemical vapor deposition and application as highly efficient conductive additive of lithium ion battery electrodes
2021IsAmongTopNSimilarDocuments
PointINet software on GitHub
IsRelatedTo
PyMCubes software on GitHub
IsRelatedTo
trimesh software on GitHub
IsRelatedTo

chevron_left
1
2
chevron_right

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green

Fields of Science (4) View all

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

View all