Visual Object Tracking Performance Measures Revisited

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Other literature type 01 Mar 2016Embargo end date: 01 Jan 2015Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Transactions on Image Processing, volume 25, pages 1,261-1,274 (issn: 1057-7149, eissn: 1941-0042,

Copyright policy )

Authors: Luka Cehovin; Ales Leonardis; Matej Kristan;

doi: 10.1109/tip.2016.2520370 , 10.48550/arxiv.1502.05803

pmid: 26812723

arXiv: 1502.05803

Visual Object Tracking Performance Measures Revisited

- Summary
- Subjects
- Metrics

Abstract

The problem of visual tracking evaluation is sporting a large variety of performance measures, and largely suffers from lack of consensus about which measures should be used in experiments. This makes the cross-paper tracker comparison difficult. Furthermore, as some measures may be less effective than others, the tracking results may be skewed or biased towards particular tracking aspects. In this paper we revisit the popular performance measures and tracker performance visualizations and analyze them theoretically and experimentally. We show that several measures are equivalent from the point of information they provide for tracker comparison and, crucially, that some are more brittle than the others. Based on our analysis we narrow down the set of potential measures to only two complementary ones, describing accuracy and robustness, thus pushing towards homogenization of the tracker evaluation methodology. These two measures can be intuitively interpreted and visualized and have been employed by the recent Visual Object Tracking (VOT) challenges as the foundation for the evaluation methodology.

Related Organizations

University of Ljubljana
Slovenia

Keywords

FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	130
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 1%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 1%