descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Sep 2023Embargo end date: 01 Jan 2022 English Publisher:Elsevier BVJournal:Image and Vision Computing, volume 137, page 104,746 (issn: 0262-8856,

Authors: Ferrante, M; Boccato, T; Spasov, S; Duggento, A; Toschi, N;

doi: 10.1016/j.imavis.2023.104746 , 10.48550/arxiv.2209.12279

arXiv: 2209.12279

handle: 2108/340863

VAESim: A probabilistic approach for self-supervised prototype discovery

- Summary
- Subjects
- Related research
  (1)
- Metrics

Abstract

In medicine, curated image datasets often employ discrete labels to describe what is known to be a continuous spectrum of healthy to pathological conditions, such as e.g. the Alzheimer's Disease Continuum or other areas where the image plays a pivotal point in diagnosis. We propose an architecture for image stratification based on a conditional variational autoencoder. Our framework, VAESim, leverages a continuous latent space to represent the continuum of disorders and finds clusters during training, which can then be used for image/patient stratification. The core of the method learns a set of prototypical vectors, each associated with a cluster. First, we perform a soft assignment of each data sample to the clusters. Then, we reconstruct the sample based on a similarity measure between the sample embedding and the prototypical vectors of the clusters. To update the prototypical embeddings, we use an exponential moving average of the most similar representations between actual prototypes and samples in the batch size. We test our approach on the MNIST-handwritten digit dataset and on a medical benchmark dataset called PneumoniaMNIST. We demonstrate that our method outperforms baselines in terms of kNN accuracy measured on a classification task against a standard VAE (up to 15% improvement in performance) in both datasets, and also performs at par with classification models trained in a fully supervised way. We also demonstrate how our model outperforms current, end-to-end models for unsupervised stratification.

Related Organizations

University of Cambridge
United Kingdom
Harvard University
United States
Martinos Center for Biomedical Imaging,
United States
Athinoula A. Martinos Center for Biomedical Imaging
United States
University of Rome Tor Vergata
Italy

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Vision and Pattern Recognition (cs.CV), Settore FIS/07, medical imaging, Computer Science - Computer Vision and Pattern Recognition, 006, deep clustering, Settore PHYS-06/A - Fisica per le scienze della vita, 004, Machine Learning (cs.LG), prototypes discovery; variational autoencoders; medical imaging; deep clustering; prototypes discovery; variational autoencoders; medical imaging; deep clustering, prototypes discovery, Signal Processing, variational autoencoder, l'ambiente e i beni culturali, Computer Vision and Pattern Recognition

1 Research products, page 1 of 1

VAESIM software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	5
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%