Predicting Head Movement in Panoramic Video: A Deep Reinforcement Learning Approach

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Nov 2019Embargo end date: 01 Jan 2017Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Transactions on Pattern Analysis and Machine Intelligence, volume 41, pages 2,693-2,708 (issn: 0162-8828, eissn: 1939-3539,

Copyright policy )

Authors: Mai Xu; Yuhang Song 0001; Jianyi Wang; Minglang Qiao; Liangyu Huo; Zulin Wang;

doi: 10.1109/tpami.2018.2858783 , 10.48550/arxiv.1710.10755

pmid: 30047871

arXiv: 1710.10755

Predicting Head Movement in Panoramic Video: A Deep Reinforcement Learning Approach

- Summary
- Subjects
- Related research
  (1)
- Metrics

Abstract

Panoramic video provides immersive and interactive experience by enabling humans to control the field of view (FoV) through head movement (HM). Thus, HM plays a key role in modeling human attention on panoramic video. This paper establishes a database collecting subjects' HM in panoramic video sequences. From this database, we find that the HM data are highly consistent across subjects. Furthermore, we find that deep reinforcement learning (DRL) can be applied to predict HM positions, via maximizing the reward of imitating human HM scanpaths through the agent's actions. Based on our findings, we propose a DRL-based HM prediction (DHP) approach with offline and online versions, called offline-DHP and online-DHP. In offline-DHP, multiple DRL workflows are run to determine potential HM positions at each panoramic frame. Then, a heat map of the potential HM positions, named the HM map, is generated as the output of offline-DHP. In online-DHP, the next HM position of one subject is estimated given the currently observed HM position, which is achieved by developing a DRL algorithm upon the learned offline-DHP model. Finally, the experiments validate that our approach is effective in both offline and online prediction of HM positions for panoramic video, and that the learned offline-DHP model can improve the performance of online-DHP.

15 pages, 10 figures, published on TPAMI 2018

Related Organizations

Beihua University
China (People's Republic of)
Beihang University
China (People's Republic of)

Keywords

Adult, Male, FOS: Computer and information sciences, Computer Science - Machine Learning, Models, Statistical, Adolescent, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Video Recording, Machine Learning (cs.LG), Young Adult, Deep Learning, Head Movements, Image Processing, Computer-Assisted, Humans, Female, Algorithms

1 Research products, page 1 of 1

DHP software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	146
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 1%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 1%