Noise-Aware Video Saliency Prediction

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 01 Jan 2021Embargo end date: 01 Jan 2021Publisher:British Machine Vision AssociationJournal:Proceedings of the British Machine Vision Conference 2021Funded by:NSF | Materials Research Scienc..., NSF | CC* Compute: A high-perf...

Authors: Ekta Prashnani; Orazio Gallo; Joohwan Kim; Josef B. Spjut; Pradeep Sen; Iuri Frosio;

doi: 10.5244/c.35.305 , 10.48550/arxiv.2104.08038

arXiv: 2104.08038

Noise-Aware Video Saliency Prediction

- Summary
- Subjects
- Metrics

Abstract

We tackle the problem of predicting saliency maps for videos of dynamic scenes. We note that the accuracy of the maps reconstructed from the gaze data of a fixed number of observers varies with the frame, as it depends on the content of the scene. This issue is particularly pressing when a limited number of observers are available. In such cases, directly minimizing the discrepancy between the predicted and measured saliency maps, as traditional deep-learning methods do, results in overfitting to the noisy data. We propose a noise-aware training (NAT) paradigm that quantifies and accounts for the uncertainty arising from frame-specific gaze data inaccuracy. We show that NAT is especially advantageous when limited training data is available, with experiments across different models, loss functions, and datasets. We also introduce a video game-based saliency dataset, with rich temporal semantics, and multiple gaze attractors per frame. The dataset and source code are available at https://github.com/NVlabs/NAT-saliency.

10 pages, 3 figures, 7 tables

Related Organizations

University of California System
United States
University of California, Santa Barbara
United States
University of California, San Francisco
United States

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Machine Learning (cs.LG)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Funded by

NSF| Materials Research Science and Engineering Center at UCSB, NSF| CC* Compute: A high-performance GPU cluster for accelerated research