Rectified factor networks for biclustering of omics data

Clevert, Djork-Arné; Unterthiner, Thomas; Povysil, Gundula; Hochreiter, Sepp

doi:10.1093/bioinformatics/btx226

Cited by 187 publications

(218 citation statements)

References 29 publications

Supporting

Mentioning

218

Contrasting

Order By: Relevance

“…More specifically, we investigated the role of the learning rate, minibatch size, class weight (explained in Section 3.2), and the use of the Exponential Linear Unit (ELU) [45] activation function compared with the Rectified Linear Unit (ReLU) preceded by Batch Normalization [46] (as discussed by Mishkin et al in [47]). Regarding the minibatch size, for some experiments we used batch training (whole data is seen in each update) instead of minibatch training (different data chunks per update).…”

Section: Methodsmentioning

confidence: 99%

Vision-Based Fall Detection with Convolutional Neural Networks

Núñez-Marcos

Azkune

Arganda‐Carreras

2017

Wireless Communications and Mobile Computing

220

125

View full text Add to dashboard Cite

One of the biggest challenges in modern societies is the improvement of healthy aging and the support to older persons in their daily activities. In particular, given its social and economic impact, the automatic detection of falls has attracted considerable attention in the computer vision and pattern recognition communities. Although the approaches based on wearable sensors have provided high detection rates, some of the potential users are reluctant to wear them and thus their use is not yet normalized. As a consequence, alternative approaches such as vision-based methods have emerged. We firmly believe that the irruption of the Smart Environments and the Internet of Things paradigms, together with the increasing number of cameras in our daily environment, forms an optimal context for vision-based systems. Consequently, here we propose a vision-based solution using Convolutional Neural Networks to decide if a sequence of frames contains a person falling. To model the video motion and make the system scenario independent, we use optical flow images as input to the networks followed by a novel three-step training phase. Furthermore, our method is evaluated in three public datasets achieving the state-of-the-art results in all three of them.

show abstract

Section: Methodsmentioning

confidence: 99%

Vision-Based Fall Detection with Convolutional Neural Networks

Núñez-Marcos

Azkune

Arganda‐Carreras

2017

Wireless Communications and Mobile Computing

220

125

View full text Add to dashboard Cite

show abstract

“…The first group are related to ReLU, i.e. LeakyReLU [24] and Very Leaky ReLU [25], RReLU [26],PReLU [27] and its generalized version -APL [28], ELU [29]. Others are based on different ideas, e.g.…”

Section: Activation Functions 311 Previous Workmentioning

confidence: 99%

Systematic evaluation of convolution neural network advances on the Imagenet

Mishkin

Sergievskiy

Matas

2017

Computer Vision and Image Understanding

265

144

View full text Add to dashboard Cite

The paper systematically studies the impact of a range of recent advances in CNN architectures and learning methods on the object categorization (ILSVRC) problem. The evalution tests the influence of the following choices of the architecture: non-linearity (ReLU, ELU, maxout, compatability with batch normalization), pooling variants (stochastic, max, average, mixed), network width, classifier design (convolutional, fully-connected, SPP), image pre-processing, and of learning parameters: learning rate, batch size, cleanliness of the data, etc.The performance gains of the proposed modifications are first tested individually and then in combination. The sum of individual gains is bigger than the observed improvement when all modifications are introduced, but the "deficit" is small suggesting independence of their benefits.We show that the use of 128x128 pixel images is sufficient to make qualitative conclusions about optimal network structure that hold for the full size Caffe and VGG nets. The results are obtained an order of magnitude faster than with the standard 224 pixel images.

show abstract

“…The activation functions of the NNs are chosen such that the NN is able to produce physically reasonable outputs. The ELU function75 used throughout hidden layers is continuously differentiable so that the loading surface predicted by the NN always has continuous derivatives. The predicted fractional loadings of both components are produced by the sigmoid function in the output layer, which ensures that the fractional loadings always satisfies 0 ≤ ŷi ≤ 1, i.e.…”

Section: Methodsmentioning

confidence: 99%