MIML-FCN+: Multi-Instance Multi-Label Learning via Fully Convolutional Networks with Privileged Information

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 01 Jul 2017Embargo end date: 01 Jan 2017Publisher:IEEEJournal:2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Authors: Hao Yang 0033; Joey Tianyi Zhou; Jianfei Cai 0001; Yew-Soon Ong;

doi: 10.1109/cvpr.2017.635 , 10.48550/arxiv.1702.08681

arXiv: 1702.08681

MIML-FCN+: Multi-Instance Multi-Label Learning via Fully Convolutional Networks with Privileged Information

- Summary
- Subjects
- Metrics

Abstract

Multi-instance multi-label (MIML) learning has many interesting applications in computer visions, including multi-object recognition and automatic image tagging. In these applications, additional information such as bounding-boxes, image captions and descriptions is often available during training phrase, which is referred as privileged information (PI). However, as existing works on learning using PI only consider instance-level PI (privileged instances), they fail to make use of bag-level PI (privileged bags) available in MIML learning. Therefore, in this paper, we propose a two-stream fully convolutional network, named MIML-FCN+, unified by a novel PI loss to solve the problem of MIML learning with privileged bags. Compared to the previous works on PI, the proposed MIML-FCN+ utilizes the readily available privileged bags, instead of hard-to-obtain privileged instances, making the system more general and practical in real world applications. As the proposed PI loss is convex and SGD compatible and the framework itself is a fully convolutional network, MIML-FCN+ can be easily integrated with state of-the-art deep learning networks. Moreover, the flexibility of convolutional layers allows us to exploit structured correlations among instances to facilitate more effective training and testing. Experimental results on three benchmark datasets demonstrate the effectiveness of the proposed MIML-FCN+, outperforming state-of-the-art methods in the application of multi-object recognition.

Accepted in CVPR 2017

Related Organizations

Agency for Science, Technology and Research
Singapore
Nanyang Technological University
Singapore
Institute of High Performance Computing
Singapore

Keywords

FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	42
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

42

Top 10%

Green

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering