2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017
DOI: 10.1109/cvpr.2017.479
|View full text |Cite
|
Sign up to set email alerts
|

Deep Learning Human Mind for Automated Visual Classification

Abstract: What if we could effectively read the mind and transfer human visual capabilities to computer vision methods? In this paper, we aim at addressing this question by developing the first visual object classifier driven by human brain signals. In particular, we employ EEG data evoked by visual object stimuli combined with Recurrent Neural Networks (RNN) to learn a discriminative brain activity manifold of visual categories. Afterward, we train a Convolutional Neural Network (CNN)-based regressor to project images … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

12
421
8
2

Year Published

2019
2019
2022
2022

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 200 publications
(443 citation statements)
references
References 23 publications
12
421
8
2
Order By: Relevance
“…Furthermore, like images, we want our signal network to be translation invariant. Significantly, stacked BLSTM layers which work surprisingly well under complex conditions [8] help in obtaining a high-level encoding of features from within these raw brain signals. To tackle the problem of varying temporal lengths of the brain signals, we follow the work proposed in [8] and represent every input brain EEG signal as a matrix, M ∈ R 440×128 where the rows represent the effective 440 timesteps and the columns represent the 128 channels [8] at each timestep.…”
Section: Proposed Methodologymentioning
confidence: 99%
See 3 more Smart Citations
“…Furthermore, like images, we want our signal network to be translation invariant. Significantly, stacked BLSTM layers which work surprisingly well under complex conditions [8] help in obtaining a high-level encoding of features from within these raw brain signals. To tackle the problem of varying temporal lengths of the brain signals, we follow the work proposed in [8] and represent every input brain EEG signal as a matrix, M ∈ R 440×128 where the rows represent the effective 440 timesteps and the columns represent the 128 channels [8] at each timestep.…”
Section: Proposed Methodologymentioning
confidence: 99%
“…The batch size of 32 was found to work the best for the conducted experiments. The Dataset For our experiments we use the dataset released in [8]. This visual stimuli dataset is a small subset of the ImageNet dataset with 40 classes with 50 images per class and accompanying EEG signal data.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…[14,15] propose a model for natural image reconstruction from fMRI brain signal data recorded from a subject while he observes the original images. There have been similar reports on similar EEG-based reconstructions [16], but the reliability of the studies has met serious controversy [17].…”
Section: Introduction and Related Workmentioning
confidence: 99%