Learning conditional variational autoencoders with missing covariates

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Mar 2024Embargo end date: 01 Jan 2022 Finland English Publisher:Elsevier BVJournal:Pattern Recognition, volume 147, page 110,113 (issn: 0031-3203,

Copyright policy )Funded by:AKA | Bridging the Reality Gap ...

Authors: Tikhonov, Gleb; Lönnroth, Otto; Tiikkainen, Pekka; Lähdesmäki; Harri; Ramchandran, Siddharth;

doi: 10.1016/j.patcog.2023.110113 , 10.48550/arxiv.2203.01218

arXiv: 2203.01218

handle: 10138/569684

Learning conditional variational autoencoders with missing covariates

- Summary
- Subjects
- Related research
  (1)
- Metrics

Abstract

Conditional variational autoencoders (CVAEs) are versatile deep generative models that extend the standard VAE framework by conditioning the generative model with auxiliary covariates. The original CVAE model assumes that the data samples are independent, whereas more recent conditional VAE models, such as the Gaussian process (GP) prior VAEs, can account for complex correlation structures across all data samples. While several methods have been proposed to learn standard VAEs from partially observed datasets, these methods fall short for conditional VAEs. In this work, we propose a method to learn conditional VAEs from datasets in which auxiliary covariates can contain missing values as well. The proposed method augments the conditional VAEs with a prior distribution for the missing covariates and estimates their posterior using amortised variational inference. At training time, our method marginalises the uncertainty associated with the missing covariates while simultaneously maximising the evidence lower bound. We develop computationally efficient methods to learn CVAEs and GP prior VAEs that are compatible with mini-batching. Our experiments on simulated datasets as well as on a clinical trial study show that the proposed method outperforms previous methods in learning conditional VAEs from non-temporal, temporal, and longitudinal datasets.

Country

Finland

Related Organizations

Keywords

ta113, FOS: Computer and information sciences, Computer Science - Machine Learning, Conditional VAEs, Computer and information sciences, Missing value imputation, Machine Learning (stat.ML), Machine Learning (cs.LG), Statistics - Machine Learning, Gaussian process, Variational autoencoders

1 Research products, page 1 of 1

GP-prior-VAE_mis-cov software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	14
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%