2021
DOI: 10.1007/978-981-16-1288-6_5
|View full text |Cite
|
Sign up to set email alerts
|

Effective and Efficient ROI-wise Visual Encoding Using an End-to-End CNN Regression Model and Selective Optimization

Abstract: In neuroscience, visual encoding based on functional magnetic resonance imaging (fMRI) has been attracting much attention, especially with the recent development of deep learning. Visual encoding model is aimed at predicting subjects' brain activity in response to presented image stimuli . Current visual encoding models firstly extract image features through a pre-trained convolutional neural network (CNN) model, and secondly learn to linearly map the extracted CNN features to each voxel. However, it is hard f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
10
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2

Relationship

2
4

Authors

Journals

citations
Cited by 6 publications
(10 citation statements)
references
References 39 publications
0
10
0
Order By: Relevance
“…Through training an encoding model to construct the mapping from visual stimuli space into fMRI voxel space, we can predict the fMRI voxels from Gallant-pattern images based on the pre-trained encoding model. In this study, we used the pre-trained encoding model for V1, V2, and V3 in our previous work (Qiao, et al 2019b) because the lower-level visual cortices are mainly responsible for the detailed visual information such as texture, location, and so on. Under the Bayesian framework, we evaluated the similarity between generated images and corresponding visual stimuli by measuring the distance between the true and predicted voxels.…”
Section: Overview Of Proposed Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…Through training an encoding model to construct the mapping from visual stimuli space into fMRI voxel space, we can predict the fMRI voxels from Gallant-pattern images based on the pre-trained encoding model. In this study, we used the pre-trained encoding model for V1, V2, and V3 in our previous work (Qiao, et al 2019b) because the lower-level visual cortices are mainly responsible for the detailed visual information such as texture, location, and so on. Under the Bayesian framework, we evaluated the similarity between generated images and corresponding visual stimuli by measuring the distance between the true and predicted voxels.…”
Section: Overview Of Proposed Methodsmentioning
confidence: 99%
“…Eventually, thousands of regression models are constructed for several visual ROIs, which is inefficient. In this study, we employed an effective and efficient encoding model as our previous study (Qiao, et al 2019b). As the high-level visual areas are difficult to encode and have bad encoding performance, we employed the encoding model for V1, V2, and V3 to preserve the fidelity of visual reconstruction.…”
Section: Encoding Model Mapping Generated Images To Voxelsmentioning
confidence: 99%
See 1 more Smart Citation
“…Data-driven visual encoding models can realize the nonlinear mapping from the visual stimuli to the brain response and learn the representation directly from the fMRI data in an end-to-end manner [ 26 , 27 , 28 , 29 , 30 , 31 , 32 ]. For example, Seeliger et al [ 31 ] proposed an end-to-end encoding model that simultaneously represents the neural information processing between different visual cortex and shows good fitting ability on the voxel activity of early visual areas.…”
Section: Introductionmentioning
confidence: 99%
“…This nonlinear mapping replaces traditional linear mapping and is intended to improve brain activity prediction accuracy. Qiao et al [2021] proposed an end-to-end CNN regression model for visual encoding based on fMRI data. As a result, this model could develop appropriate feature representations and linear regression weights for visual cortical responses while also significantly improving prediction performance.…”
Section: Introductionmentioning
confidence: 99%