Deep Learning for System Trace Restoration

Sucholutsky, Ilia; Narayan, Apurva; Schonlau, Matthias; Fischmeister, Sebastian

doi:10.1109/ijcnn.2019.8852116

Cited by 5 publications

(24 citation statements)

References 36 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Restorer solves the intermediate problem of restoring missing elements in sequences of discrete data while entirely replacing the recurrent components of existing solutions. We demonstrate that such an approach leads to reduced model sizes, faster training times, and higher-quality reconstruction when compared to what we will refer to as ''the LSTM model'' described by Sucholutsky et al (2019).…”

Section: Introductionmentioning

confidence: 94%

“…There are also a number of data restoration methods that assume that all the data is available at once (Blend & Marwala, 2008;Leke, Marwala & Paul, 2015;Gondara & Wang, 2017;Beaulieu-Jones & Moore, 2017). However, in the context of data restoration specifically for discrete, streaming data it was only recently demonstrated that a simple LSTM model can be used to restore missing message IDs in automotive data (Sucholutsky et al, 2019).…”

Section: Data Restoration With Deep Learningmentioning

confidence: 99%

“…In fact, we show empirically that our Restorer architectures have as many as two orders of magnitude fewer parameters than the LSTM model as seen in Table 2. Sucholutsky et al (2019) implemented the LSTM restoration model in Tensorflow. We have re-implemented it in Keras in order to be more consistent as we implement our Restorer model in Keras based on the implementation of Transformer by Lsdefine (2018).…”

Section: Layer Type Complexity Sequential Maximum Per Layermentioning

confidence: 99%

“…For consistency during comparison, we utilized a similar dataset to the one that the LSTM restoration model was trained on (Sucholutsky et al, 2019). We collected automotive Controller Area Network (CAN) traces from a Lexus RX450 h hybrid SUV.…”

Section: Experiments Datamentioning

confidence: 99%

“…However, while a number of deep learning algorithms have been successfully shown to solve numerous end-stage problems like prediction and classification (Glorot, Bordes & Bengio, 2011;LeCun, Bengio & Hinton, 2015;Abadi et al, 2016), very few attempts have been made to use them for solving the intermediate problems of data pre-processing (Kotsiantis, Kanellopoulos & Pintelas, 2006;García, Luengo & Herrera, 2015), cleaning (Kotsiantis, Kanellopoulos & Pintelas, 2006;García, Luengo & Herrera, 2015), and restoration (Efron, 1994;Lakshminarayan et al, 1996), even though from a machine learning perspective these end-stage and intermediate problems can be very similar. Long Short-Term Memory (LSTM) networks have previously been proposed as a solution to these intermediate problems (Zhou & Huang, 2017;Sucholutsky et al, 2019), but they suffer from major bottlenecks like requiring large numbers of sequential operations that cannot be parallelized. Recently, Transformer (Vaswani et al, 2017), a novel encoder-decoder model that heavily uses attention mechanisms (Luong, Pham & Manning, 2015), was proposed as a replacement for encoder-decoder models that use LSTM or convolutional layers, and was shown to achieve state-of-the-art translation results with orders of magnitude fewer parameters than existing models.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Pay attention and you won’t lose it: a deep learning approach to sequence imputation

Sucholutsky

Narayan

Schonlau

et al. 2019

PeerJ Computer Science

Self Cite

View full text Add to dashboard Cite

In most areas of machine learning, it is assumed that data quality is fairly consistent between training and inference. Unfortunately, in real systems, data are plagued by noise, loss, and various other quality reducing factors. While a number of deep learning algorithms solve end-stage problems of prediction and classification, very few aim to solve the intermediate problems of data pre-processing, cleaning, and restoration. Long Short-Term Memory (LSTM) networks have previously been proposed as a solution for data restoration, but they suffer from a major bottleneck: a large number of sequential operations. We propose using attention mechanisms to entirely replace the recurrent components of these data-restoration networks. We demonstrate that such an approach leads to reduced model sizes by as many as two orders of magnitude, a 2-fold to 4-fold reduction in training times, and 95% accuracy for automotive data restoration. We also show in a case study that this approach improves the performance of downstream algorithms reliant on clean data.How to cite this article Sucholutsky I, Narayan A, Schonlau M, Fischmeister S. 2019. Pay attention and you won't lose it: a deep learning approach to sequence imputation. PeerJ Comput. Sci. 5:e210 http://doi.

show abstract

Section: Introductionmentioning

confidence: 94%

Section: Data Restoration With Deep Learningmentioning

confidence: 99%

Section: Layer Type Complexity Sequential Maximum Per Layermentioning

confidence: 99%

Section: Experiments Datamentioning

confidence: 99%