Visual understanding and personalization for an optimal recollection experience

Name: Visual understanding and personalization for an optimal recollection experience
Creator: Ana, Garcia del Molino
Keywords: :Computer science and engineering::Computing methodologies::Image processing and computer vision [Engineering], :Computer science and engineering::Computing methodologies::Pattern recognition [Engineering]

Ana, Garcia del Molino

Found an issue? Give us feedback

https://dr.ntu.edu.s...arrow_drop_down

https://dr.ntu.edu.sg/bitstrea...

Doctoral thesis

Data sources: UnpayWall

Digital Repository of NTU

Thesis . 2019

Data sources: Digital Repository of NTU

https://doi.org/10.32657/10356...

Doctoral thesis . 2020 . Peer-reviewed

Data sources: Crossref

DBLP

Doctoral thesis

Data sources: DBLP

https://dx.doi.org/10.32657/10...

Thesis

Data sources: Microsoft Academic Graph

Visual understanding and personalization for an optimal recollection experience

descriptionPublicationkeyboard_double_arrow_right Doctoral thesis , Thesis 28 Oct 2020 Singapore Publisher:Nanyang Technological University

Authors: Ana, Garcia del Molino;

doi: 10.32657/10356/82932

Visual understanding and personalization for an optimal recollection experience

- Summary
- Subjects
- Metrics

Abstract

The affordability of wearable cameras such as the Narrative Clip and GoPro allows mass-market consumers to continuously record their lives, producing large amounts of unstructured visual data. Moreover, users tend to record with their smartphones more multimedia content than they can possibly share or review. We use each of these devices for different purposes: action cameras for travels and adventures; our smartphones to capture on the spur of the moment; a lifelogging device to record unobtrusively all our daily life activities. As a result, the few important shots end up buried among many repetitive images or uninteresting long segments, requiring hours of manual analysis in order to, say, select highlights in a day or find the most aesthetic pictures. Tackling challenges in end-to-end consumer video summarization, this thesis contributes to the state of the art in three major aspects: (i) Contextual Event Segmentation, an episodic event segmentation method that is able to detect boundaries between heterogeneous events and ignore local occlusions and brief diversions. CES improves the performance of the baselines by over 16% in F-measure, and is competitive with manual annotations. (ii) Personalized Highlight Detection, a highlight detector that is personalized via its inputs. The experimental results show that using the user history substantially improves the prediction accuracy. PHD outperforms the user-agnostic baselines even with only one single person-specific example. (iii) Active Video Summarization, an interactive approach to video exploration that gathers the user’s preferences while creating a video summary. AVS achieves an excellent compromise between usability and quality. The diverse and uniform nature of AVS summaries makes it alsoa valuable tool for browsing someone else’s visual collection. Additionally, this thesis contributes two large-scale datasets for First Person View video analysis, CSumm and R3, and a large-scale dataset for personalized video highlights, PHD2. Doctor of Philosophy

Country

Singapore

Related Organizations

Nanyang Technological University
Singapore

Keywords

:Computer science and engineering::Computing methodologies::Image processing and computer vision [Engineering], :Computer science and engineering::Computing methodologies::Pattern recognition [Engineering]

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green

bronze