Online Deep Learning: Learning Deep Neural Networks on the Fly

Sahoo, Debasish; Pham, Quang; Lü, Jing; Hoi, Steven C. H.

doi:10.24963/ijcai.2018/369

Cited by 221 publications

(136 citation statements)

References 6 publications

Supporting

Mentioning

136

Contrasting

Order By: Relevance

“…The structural learning scenario is mainly driven by feature similarity and does not fully operate in the one-pass learning mode. [12] puts forward the hedge backpropagation method to answer the research question as to how and when a DNN structure should be adapted. This work, however, assumes that an initial structure of DNN exists and is built upon a fixed-capacity network.…”

Section: Related Workmentioning

confidence: 99%

DEVDAN: Deep evolving denoising autoencoder

et al. 2020

View full text Add to dashboard Cite

The Denoising Autoencoder (DAE) enhances the flexibility of data stream method in exploiting unlabeled samples. Nonetheless, the feasibility of DAE for data stream analytic deserves in-depth study because it characterizes a fixed network capacity which cannot adapt to rapidly changing environments. Deep evolving denoising autoencoder (DEVDAN), is proposed in this paper. It features an open structure in the generative phase and the discriminative phase where the hidden units can be automatically added and discarded on the fly. The generative phase refines the predictive performance of discriminative model exploiting unlabeled data. Furthermore, DEVDAN is free of the problem-specific threshold and works fully in the single-pass learning fashion. We show that DEVDAN can find competitive network architecture compared with state-of-the-art methods on the classification task using ten prominent datasets simulated under the prequential test-then-train protocol.

show abstract

Section: Related Workmentioning

confidence: 99%

DEVDAN: Deep evolving denoising autoencoder

et al. 2020

View full text Add to dashboard Cite

show abstract

“…The key difference lies in the calculation of mean and variance directly obtained from the bias itself rather than from the binomial distribution because the hidden unit growing strategy analyzes a real variablebias instead of the accuracy score. The high bias problem leading to the introduction of new hidden unit is formulated as follows: µ t bias + σ t bias ≥ µ min bias + πσ min bias (4) where π = 1.25exp(−Bias 2 ) + 0.75 and controls the confidence degree of sigma rule. It is observed that π is set adaptive as a factor of Bias and revolves around [1,2].…”

Section: Adaptive Learning Strategy Of Network Widthmentioning

confidence: 99%

“…µ t var + σ t var ≥ µ min var + 2χσ min var (5) Compared to (4), the term 2 is introduced and meant to avoid a direct pruning after adding situation since addition of a new hidden unit leads to temporary increase of network variance but gradually decreases as next observations are come across. χ is set akin to π in the (4) as π = 1.25exp(−V ariance 2 ) + 0.75 which consequently causes k sigma rule in the range of [1,4]. This strategy navigates to between 68.2% and 99.9% confidence level.…”

Section: Adaptive Learning Strategy Of Network Widthmentioning

confidence: 99%

“…[5] is crafted under a shallow network structure and the structural learning mechanism is done with the absence of particular criterion. Furthermore, ODL [4] does not explore the issue of structural learning and simply works with a predefined network structure.…”

Section: Proof Of Conceptsmentioning

confidence: 99%

“…The elastic consolidation weight (ECW) method is proposed in [1] where it overcomes the catastrophic forgetting issue by preventing a large deviation of output weights from the old one. Online deep learning (ODL) is proposed in [4] which introduces the idea of hedging. The hedging concept opens a direct connection of hidden layer to output layer with a connective weight and shares some similarity to the concept of weighted voting scheme in the ensemble learning literature.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Automatic Construction of Multi-layer Perceptron Network from Streaming Examples

Pratama

Za’in

Ashfahani

et al. 2019

Proceedings of the 28th ACM International Conference on Information and Knowledge Management

View full text Add to dashboard Cite

Autonomous construction of deep neural network (DNNs) is desired for data streams because it potentially offers two advantages: proper model's capacity and quick reaction to drift and shift. While self-organizing mechanism of DNNs remains an open issue, this task is even more challenging to be developed for standard multi-layer DNNs than that using the different-depth structures, because addition of a new layer results in information loss of previously trained knowledge. A Neural Network with Dynamically Evolved Capacity (NADINE) is proposed in this paper. NADINE features a fully open structure where its network structure, depth and width, can be automatically evolved from scratch in the online manner and without the use of problem-specific thresholds. NADINE is structured under a standard MLP architecture and the catastrophic forgetting issue during the hidden layer addition phase is resolved using the proposal of soft-forgetting and adaptive memory methods. The advantage of NADINE, namely elastic structure and online learning trait, is numerically validated using nine data stream classification and regression problems where it demonstrates performance's improvement over prominent algorithms in all problems. In addition, it is capable of dealing with data stream regression and classification problems equally well.

show abstract

AE‐Integrated: Real‐time network intrusion detection with Apache Kafka and autoencoder

Roshan,

Zafar

2024

Concurrency and Computation

View full text Add to dashboard Cite

SummaryUnknown cyber‐attack detection in network traffic streams is challenging but crucial to ensure network security. It is observed that new security threats occur on a daily basis and make cyberspace vulnerable. In the literature, machine learning and deep learning‐based network intrusion detection systems have gained a lot of success but still face many challenges in detecting new security threats and unknown cyber‐attacks in real‐time. Additionally, high false alarm rates and real‐time detection in constantly evolving high‐dimensional network data streams are open issues for the research community. To address this issue, a DL‐based solution is developed to detect real‐time network anomalies in streaming data with high detection accuracy, precision, recall and low false negative and positive scenarios. The proposed novel algorithm, AE‐Integrated, is developed and evaluated on the latest CICIDS‐2017 dataset. The AE‐Integrated is updated with the newest network traffic data stream by the human administrator after a certain period to maintain its prediction accuracy for future inference. The simulation study is conducted with the Apache Kafka and Slack API to get real‐time anomaly alerts. Finally, we compared the result with recent state‐of‐the‐art research to evaluate the significance of the proposed algorithm. It is concluded that combining multiple lightweight autoencoders into a single large architecture provides optimal results. The accuracy, recall, and AUC of AE‐Integrated obtained are 99.54%, 99.53%, and 0.998, respectively.

show abstract

Online Deep Learning: Learning Deep Neural Networks on the Fly

Cited by 221 publications

References 6 publications

DEVDAN: Deep evolving denoising autoencoder

DEVDAN: Deep evolving denoising autoencoder

Automatic Construction of Multi-layer Perceptron Network from Streaming Examples

AE‐Integrated: Real‐time network intrusion detection with Apache Kafka and autoencoder

Contact Info

Product

Resources

About