Salient object detection in egocentric videos

descriptionPublicationkeyboard_double_arrow_right Article 13 Mar 2024 English Publisher:Institution of Engineering and Technology (IET)Journal:IET Image Processing, volume 18, pages 2,028-2,037 (issn: 1751-9659, eissn: 1751-9667,

Copyright policy )

Authors: Hao Zhang; Haoran Liang 0001; Xing Zhao 0001; Jian Liu; Ronghua Liang;

doi: 10.1049/ipr2.13080

Salient object detection in egocentric videos

- Summary
- Subjects
- Metrics

Abstract

Abstract In the realm of video salient object detection (VSOD), the majority of research has traditionally been centered on third‐person perspective videos. However, this focus overlooks the unique requirements of certain first‐person tasks, such as autonomous driving or robot vision. To bridge this gap, a novel dataset and a camera‐based VSOD model, CaMSD , specifically designed for egocentric videos, is introduced. First, the SalEgo dataset, comprising 17,400 fully annotated frames for video salient object detection, is presented. Second, a computational model that incorporates a camera movement module is proposed, designed to emulate the patterns observed when humans view videos. Additionally, to achieve precise segmentation of a single salient object during switches between salient objects, as opposed to simultaneously segmenting two objects, a saliency enhancement module based on the Squeeze and Excitation Block is incorporated. Experimental results show that the approach outperforms other state‐of‐the‐art methods in egocentric video salient object detection tasks. Dataset and codes can be found at https://github.com/hzhang1999/SalEgo .

Related Organizations

Zhejiang University of Technology
China (People's Republic of)

Keywords

QA76.75-76.765, Photography, object detection, Computer software, TR1-1050, image processing

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	2
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

2

Top 10%

Average

gold

Fields of Science (4) View all

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

View all