Video2vec Embeddings Recognize Events when Examples are Scarce

Article English OPEN
Habibian, A.; Mensink, T.; Snoek, C.G.M.;

This paper aims for event recognition when video examples are scarce or even completely absent. The key in such a challenging setting is a semantic video representation. Rather than building the representation from individual attribute detectors and their annotations, w... View more
  • References (84)
    84 references, page 1 of 9

    [1] CMU sphinx open source toolkit for speech recognition. http:// Accessed: 2015-08-03.

    [2] Z. Akata, F. Perronnin, Z. Harchaoui, and C. Schmid. Labelembedding for attribute-based classification. In CVPR, 2013.

    [3] Y. Aytar, M. Shah, and J. Luo. Utilizing semantic word similarity measures for video retrieval. In CVPR, 2008.

    [4] T. Berg, A. Berg, and J. Shih. Automatic attribute discovery and characterization from noisy web data. In ECCV, 2010.

    [5] S. Bhattacharya, M. M. Kalayeh, R. Sukthankar, and M. Shah. Recognition of complex events: Exploiting temporal dynamics between underlying concepts. In CVPR, 2014.

    [6] D. M. Blei and M. I. Jordan. Modeling annotated data. In SIGIR, 2003.

    [7] L. Bottou. Large-scale machine learning with stochastic gradient descent. In ICCS, 2010.

    [8] X. Chang, Y. Yang, A. G. Hauptmann, E. P. Xing, and Y.-L. Yu. Semantic concept discovery for large-scale zero-shot event detection. In IJCAI, 2015.

    [9] X. Chang, Y.-L. Yu, Y. Yang, and A. G. Hauptmann. Searching persuasively: Joint event detection and evidence recounting with limited supervision. In MM, 2015.

    [10] J. Chen, Y. Cui, G. Ye, D. Liu, and S.-F. Chang. Event-driven semantic concept discovery by exploiting weakly tagged internet images. In ICMR, 2014.

  • Metrics
Share - Bookmark