2018
DOI: 10.48550/arxiv.1812.05339
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

DeepCruiser: Automated Guided Testing for Stateful Deep Learning Systems

Xiaoning Du,
Xiaofei Xie,
Yi Li
et al.

Abstract: Deep learning (DL) defines a data-driven programming paradigm that automatically composes the system decision logic from the training data. In company with the data explosion and hardware acceleration during the past decade, DL achieves tremendous success in many cutting-edge applications. However, even the state-of-the-art DL systems still suffer from quality and reliability issues. It was only until recently that some preliminary progress was made in testing feed-forward DL systems.In contrast to feed-forwar… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 8 publications
(10 citation statements)
references
References 28 publications
0
10
0
Order By: Relevance
“…Testing of machine learning algorithms, especially for sophisticated algorithms like deep learning systems, requires specific testing approaches. To that end, recently several adequacy criteria for deep learning systems like neuron coverage [24], surprise adequacy [19] or criteria derived from modeling deep learning systems as abstract state transition systems [8] were defined. However, testing algorithms is not sufficient as the integration of algorithms into systems can be complex, leading to problems and defects being injected along the way.…”
Section: Discussionmentioning
confidence: 99%
“…Testing of machine learning algorithms, especially for sophisticated algorithms like deep learning systems, requires specific testing approaches. To that end, recently several adequacy criteria for deep learning systems like neuron coverage [24], surprise adequacy [19] or criteria derived from modeling deep learning systems as abstract state transition systems [8] were defined. However, testing algorithms is not sufficient as the integration of algorithms into systems can be complex, leading to problems and defects being injected along the way.…”
Section: Discussionmentioning
confidence: 99%
“…To test audio-based deep learning systems, Du et al [78] designed a set of transformations tailored to audio inputs considering background noise and volume variation. They first abstracted and extracted a probabilistic transition model from an RNN.…”
Section: Domain-specific Test Input Synthesismentioning
confidence: 99%
“…We will introduce more related work about using domain-specific metamorphic relations of testing autonomous driving, Differentiable Neural Computer (DNC) [93], machine translation systems [123], [124], biological cell classification [79], and audio-based deep learning systems [78] in Section 8.…”
Section: Metamorphic Relations As Test Oraclesmentioning
confidence: 99%
“…In addition to CV models, natural language processing (NLP) models and their critical applications in machine translation have also been tested [19,41,42,76]. We also notice recent works on testing RNNs and RL models [28,29,40,45,80]. MDPFuzzer tests FNNs, RL, IL, and MARL models for solving MDPs, where existing efforts in testing FNNs and RNNs are not applicable, as will be discussed in Sec.…”
Section: Related Workmentioning
confidence: 99%