2017
DOI: 10.1007/978-3-319-54184-6_14
|View full text |Cite
|
Sign up to set email alerts
|

Joint Training of Generic CNN-CRF Models with Stochastic Optimization

Abstract: Abstract. We propose a new CNN-CRF end-to-end learning framework, which is based on joint stochastic optimization with respect to both Convolutional Neural Network (CNN) and Conditional Random Field (CRF) parameters. While stochastic gradient descent is a standard technique for CNN training, it was not used for joint models so far. We show that our learning method is (i) general, i.e. it applies to arbitrary CNN and CRF architectures and potential functions; (ii) scalable, i.e. it has a low memory footprint an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
15
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 14 publications
(15 citation statements)
references
References 32 publications
0
15
0
Order By: Relevance
“…where T m and F m are the sets of labels and features in the m-th frame in the training dataset. Generally, this minimization problem is intractable and various approximation techniques, such as contrastive divergence (CD) [24] or its variant [25], are employed in existing DNN-CRF approaches. Our problem of important/unimportant people classification, however, basically is small.…”
Section: Trainingmentioning
confidence: 99%
“…where T m and F m are the sets of labels and features in the m-th frame in the training dataset. Generally, this minimization problem is intractable and various approximation techniques, such as contrastive divergence (CD) [24] or its variant [25], are employed in existing DNN-CRF approaches. Our problem of important/unimportant people classification, however, basically is small.…”
Section: Trainingmentioning
confidence: 99%
“…The convolutional layer only connects each neuron of the output to a small region (window) of the input which is referred to as a feature map, thus greatly reducing the number of parameters. CNN has demonstrated itself as one of the most successful tools in the area of computer vision [40][41][42][43][44], and more recently, it also found applications in natural language processing [45][46][47][48][49][50].…”
Section: Introductionmentioning
confidence: 99%
“…CRFs have been used to model important geometric characteristics such as shape, region connectivity, contextual information between regions and so on. For these reasons, there has been a recent trend on exploring the integration of CNN and CRF methods [25], [26], [27], [28]. For example, Liu et.…”
Section: Introductionmentioning
confidence: 99%