NSD-Imagery: A benchmark dataset for extending fMRI vision decoding methods to mental imagery

Name: NSD-Imagery: A benchmark dataset for extending fMRI vision decoding methods to mental imagery
Keywords: FOS: Computer and information sciences, Computer Science - Machine Learning, Quantitative Biology - Neurons and Cognition, Computer Vision and Pattern Recognition (cs.CV), FOS: Biological sciences, Image and Video Processing (eess.IV), Computer Science - Computer Vision and Pattern Recognition, FOS: Electrical engineering, electronic engineering, information engineering, Neurons and Cognition (q-bio.NC), Electrical Engineering and Systems Science - Image and Video Processing

Reese Kneeland; Paul S. Scotti; Ghislain St-Yves; Jesse Breedlove; Kendrick N. Kay; Thomas Naselaris

Found an issue? Give us feedback

arXiv.org e-Print Ar...arrow_drop_down

arXiv.org e-Print Archive

Preprint . 2025

Data sources: arXiv.org e-Print Archive

https://doi.org/10.1109/cvpr52...

Article . 2025 . Peer-reviewed

License: STM Policy #29

Data sources: Crossref

https://dx.doi.org/10.48550/ar...

Article . 2025

License: CC BY NC ND

Data sources: Datacite

DBLP

Article

Data sources: DBLP

DBLP

Conference object

Data sources: DBLP

NSD-Imagery: A benchmark dataset for extending fMRI vision decoding methods to mental imagery

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 10 Jun 2025Embargo end date: 01 Jan 2025Publisher:IEEEJournal:2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Authors: Reese Kneeland; Paul S. Scotti; Ghislain St-Yves; Jesse Breedlove; Kendrick N. Kay; Thomas Naselaris;

doi: 10.1109/cvpr52734.2025.02687 , 10.48550/arxiv.2506.06898

arXiv: 2506.06898

NSD-Imagery: A benchmark dataset for extending fMRI vision decoding methods to mental imagery

- Summary
- Subjects
- Metrics

Abstract

We release NSD-Imagery, a benchmark dataset of human fMRI activity paired with mental images, to complement the existing Natural Scenes Dataset (NSD), a large-scale dataset of fMRI activity paired with seen images that enabled unprecedented improvements in fMRI-to-image reconstruction efforts. Recent models trained on NSD have been evaluated only on seen image reconstruction. Using NSD-Imagery, it is possible to assess how well these models perform on mental image reconstruction. This is a challenging generalization requirement because mental images are encoded in human brain activity with relatively lower signal-to-noise and spatial resolution; however, generalization from seen to mental imagery is critical for real-world applications in medical domains and brain-computer interfaces, where the desired information is always internally generated. We provide benchmarks for a suite of recent NSD-trained open-source visual decoding models (MindEye1, MindEye2, Brain Diffuser, iCNN, Takagi et al.) on NSD-Imagery, and show that the performance of decoding methods on mental images is largely decoupled from performance on vision reconstruction. We further demonstrate that architectural choices significantly impact cross-decoding performance: models employing simple linear decoding architectures and multimodal feature decoding generalize better to mental imagery, while complex architectures tend to overfit visual training data. Our findings indicate that mental imagery datasets are critical for the development of practical applications, and establish NSD-Imagery as a useful resource for better aligning visual decoding methods with this goal.

Published at CVPR 2025

Related Organizations

University of Minnesota Morris
United States

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Quantitative Biology - Neurons and Cognition, Computer Vision and Pattern Recognition (cs.CV), FOS: Biological sciences, Image and Video Processing (eess.IV), Computer Science - Computer Vision and Pattern Recognition, FOS: Electrical engineering, electronic engineering, information engineering, Neurons and Cognition (q-bio.NC), Electrical Engineering and Systems Science - Image and Video Processing, Machine Learning (cs.LG)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green