Explicitly Diverse Visual Question Generation

descriptionPublicationkeyboard_double_arrow_right Article 01 Jan 2024Publisher:Elsevier BVJournal:Neural Networks, volume 184, page 107,002 (issn: 0893-6080,

Copyright policy )

Authors: Jiayuan Xie; Jiasheng Zheng; Wenhao Fang; Yi Cai 0001; Qing Li 0001;

doi: 10.2139/ssrn.4719923 , 10.1016/j.neunet.2024.107002

pmid: 39709645

Explicitly Diverse Visual Question Generation

- Summary
- Subjects
- Metrics

Abstract

Visual question generation involves the generation of meaningful questions about an image. Although we have made significant progress in automatically generating a single high-quality question related to an image, existing methods often ignore the diversity and interpretability of generated questions, which are important for various daily tasks that require clear question sources. In this paper, we propose an explicitly diverse visual question generation model that aims to generate diverse questions based on interpretable question sources. To explicitly perform question generation, our model first extracts the scene graph from the image using the unbiased scene graph generation method, where questions generated based on the scene graphs have interpretable question sources. To ensure the diversity of generated questions, our model selects different subgraphs from the scene graph as question sources. Specifically, we employ a subgraph selector to learn how humans select multiple subgraphs that are suitable for question generation. Finally, our model generates diverse questions based on different selected subgraphs. Extensive experiments on the VQA v2.0 and COCO-QA datasets show that the proposed model outperforms the baselines and is able to interpretably generate diverse questions.

Related Organizations

South China University of Technology
China (People's Republic of)
Hong Kong Polytechnic University
China (People's Republic of)

Keywords

Visual Perception, Humans, Neural Networks, Computer, Algorithms

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

1

Average

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now