descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 26 Feb 2025Embargo end date: 01 Jan 2024Publisher:IEEEJournal:2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)

Authors: Pucci, R. (Rita); N. Martinel (Niki);

doi: 10.1109/wacv61041.2025.00212 , 10.48550/arxiv.2406.01294

arXiv: 2406.01294

CE-VAE: Capsule Enhanced Variational AutoEncoder for Underwater Image Enhancement

- Summary
- Subjects
- Metrics

Abstract

Unmanned underwater image analysis for marine monitoring faces two key challenges: (i) degraded image quality due to light attenuation and (ii) hardware storage constraints limiting high-resolution image collection. Existing methods primarily address image enhancement with approaches that hinge on storing the full-size input. In contrast, we introduce the Capsule Enhanced Variational AutoEncoder (CE-VAE), a novel architecture designed to efficiently compress and enhance degraded underwater images. Our attention-aware image encoder can project the input image onto a latent space representation while being able to run online on a remote device. The only information that needs to be stored on the device or sent to a beacon is a compressed representation. There is a dual-decoder module that performs offline, full-size enhanced image generation. One branch reconstructs spatial details from the compressed latent space, while the second branch utilizes a capsule-clustering layer to capture entity-level structures and complex spatial relationships. This parallel decoding strategy enables the model to balance fine-detail preservation with context-aware enhancements. CE-VAE achieves state-of-the-art performance in underwater image enhancement on six benchmark datasets, providing up to 3x higher compression efficiency than existing approaches. Code available at \url{https://github.com/iN1k1/ce-vae-underwater-image-enhancement}.

Accepted for publication at IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)

Related Organizations

University of Udine
Italy
Naturalis Biodiversity Center
Netherlands

Keywords

FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Image and Video Processing (eess.IV), Computer Science - Computer Vision and Pattern Recognition, FOS: Electrical engineering, electronic engineering, information engineering, Electrical Engineering and Systems Science - Image and Video Processing

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Average

Green

Beta

SDGs Suggest

14. Life underwater

Beta

SDGs:

14. Life underwater,

Related to Research communities

Knowmad Institut

Netherlands Research Portal