Handbook of Neural Computation 2017
DOI: 10.1016/b978-0-12-811318-9.00020-x
|View full text |Cite
|
Sign up to set email alerts
|

Scene Understanding Using Deep Learning

Abstract: Deep learning is a type of machine perception method that attempts to model highlevel abstractions in data and encode them into a compact and robust representation. Such representations have found immense usage in applications related to computer vision. In this chapter we introduce two such applications, i.e., semantic segmentation of images and action recognition in videos. These applications are of fundamental importance for human-centered environment perception.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
12
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 19 publications
(14 citation statements)
references
References 26 publications
0
12
0
Order By: Relevance
“…In contrast to the convolutional layers, fully-connected layers comprise more parameters that lead to them being harder to train [38]. The last layer for prediction in the CNN architecture is the regression layer that makes the relationship between an ideal target and one or more independent features (the input data) [30,39].…”
Section: Convolutional Neural Networkmentioning
confidence: 99%
See 1 more Smart Citation
“…In contrast to the convolutional layers, fully-connected layers comprise more parameters that lead to them being harder to train [38]. The last layer for prediction in the CNN architecture is the regression layer that makes the relationship between an ideal target and one or more independent features (the input data) [30,39].…”
Section: Convolutional Neural Networkmentioning
confidence: 99%
“…Training a CNN means that the model has to be learned the best weights and biases in each layer for minimizing a defined cost function. The procedure of the minimization of a cost function is executed repetitively, utilizing a gradient descent technique that includes the computation of fractional derivatives from the cost function [38].…”
Section: Convolutional Neural Networkmentioning
confidence: 99%
“…In addition, deep learning has an active merit of transfer learning that a myriad of pretraining networks and public datasets have provided benefits for training numerous traffic scenes. For vehicle-related scenes, to understand the objects, scenes, and events in the video, the deep learning neural network attempts to emulate the high-level abstraction in the data and encode it as a robust representation (Husain, et al, 2017).…”
Section: Background and Motivationmentioning
confidence: 99%
“…This thesis focuses on the most popular variation of feed-forward ANN called Convolutional Neural Network (CNN). CNN provides solutions to a range of problems, from computer vision [4][5][6][7][8][9][10][11], raw audio generation [12], to general data analytics [13], to name a few. Each network consists of a repeating set of layers (Convolution, Non-linear Activation, Pooling, Fully Connected) interconnected sequentially.…”
Section: Convolutional Neural Networkmentioning
confidence: 99%
“…Convolutional Neural Networks (CNN) provide solutions to a range of problems, from computer vision [4][5][6][7][8][9][10][11], raw audio generation [12], to general data analytics [13], to name some. Each network consists of a set of layers, where each layer transforms its input using a set of filters and produces a multi-channel output.…”
Section: Introductionmentioning
confidence: 99%