2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2016
DOI: 10.1109/cvpr.2016.12
|View full text |Cite
|
Sign up to set email alerts
|

Neural Module Networks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
800
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 830 publications
(801 citation statements)
references
References 23 publications
1
800
0
Order By: Relevance
“…In contrast to VQA, we use synthetic images and emphasize representing a broad range of language phenomena. Our motivation is similar to that of SHAPES (Andreas et al, 2016b) and CLEVR (Johnson et al, 2016). Both datasets also use synthetic images and emphasize representing diverse spatial language.…”
Section: Related Work and Datasetsmentioning
confidence: 99%
See 2 more Smart Citations
“…In contrast to VQA, we use synthetic images and emphasize representing a broad range of language phenomena. Our motivation is similar to that of SHAPES (Andreas et al, 2016b) and CLEVR (Johnson et al, 2016). Both datasets also use synthetic images and emphasize representing diverse spatial language.…”
Section: Related Work and Datasetsmentioning
confidence: 99%
“…NMN The neural module networks approach of Andreas et al (2016b). We experiment with the default maximum leaves of two, and with allowing for more expressive representations with a maximum leaves of five.…”
Section: Image Representationmentioning
confidence: 99%
See 1 more Smart Citation
“…Our approach is inspired by recent work in using neural networks to infer programs expressed in some high-level language, e.g., to answer question involving complex arithmetic, logical, or semantic parsing operations [33]- [41]. Approaches, such as [42], [43], produce programs composed of functions that perform compositional reasoning on an image using an execution engine consisting of neural modules [44]. Similarly, our method produces a program consisting of shape modeling instructions to match a target image by incorporating a shape renderer.…”
Section: Neural Program Inductionmentioning
confidence: 99%
“…In this work, to address the limitations of the existing works, we propose a novel NEP (Neural Embedding Propagation) framework for SSL over heterogeneous networks. Figure 1 gives a running toy example of NEP, which is a powerful yet efficient neural framework that coherently combines an object encoder [16], [17] and a modular network [24], [25]. It leverages the compositional nature of metapaths and trivially generalizes to attributed networks.…”
Section: Introductionmentioning
confidence: 99%