2018 3rd International Conference on Computational Intelligence and Applications (ICCIA) 2018
DOI: 10.1109/iccia.2018.00019
|View full text |Cite
|
Sign up to set email alerts
|

Safely and Quickly Deploying New Features with a Staged Rollout Framework Using Sequential Test and Adaptive Experimental Design

Abstract: During the rapid development cycle for Internet products (websites and mobile apps), new features are developed and rolled out to users constantly. Features with code defects or design flaws can cause outages and significant degradation of user experience. The traditional method of code review and change management can be time-consuming and error-prone. In order to make the feature rollout process safe and fast, this paper proposes a methodology for rolling out features in an automated way using an adaptive ex… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 18 publications
(20 reference statements)
0
7
0
Order By: Relevance
“…when a regression is detected. Towards this end, sequential tests for difference-in-means of Gaussian random variables have already been widely used for online A/B experiments [11][12][13]27]. However, we argue that performing inference about means is too limited for canary tests, for not all bugs or performance regressions can be captured by differences in the mean alone, as the following example demonstrates.…”
Section: Regression-driven Experimentsmentioning
confidence: 99%
See 2 more Smart Citations
“…when a regression is detected. Towards this end, sequential tests for difference-in-means of Gaussian random variables have already been widely used for online A/B experiments [11][12][13]27]. However, we argue that performing inference about means is too limited for canary tests, for not all bugs or performance regressions can be captured by differences in the mean alone, as the following example demonstrates.…”
Section: Regression-driven Experimentsmentioning
confidence: 99%
“…Johari et al [11,13] proposed an "always-valid" sequential inference framework for differences in the means of Gaussian random variables using the mSPRT to provide confidence sequences and sequential 𝑝-values. In addition to being used in commercial A/B testing software, [27] use this framework for managing the automated rollout of new software features, formulating performance regressions as differences in the mean.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Besides, testing immature features could cause outages or critical harmness on key business metrics. The monitoring on key metrics [20,22,24] can help alarm experimenter at the very early hours and ensure a safe data collection, which can be another interesting application to further explore.…”
Section: Summary and Future Workmentioning
confidence: 99%
“…Both are targeted to minimize false positives in their processes while attaining good recall of the signals. The methodologies involve a novel Population Stability Index (PSI [21]) based test and a sequential probability ratio test (SPRT [4,5,14,18,19,24]). To our knowledge, we are the first to automate the methods (in early 2019) in a large scale experimentation platform to continuously monitor experiment quality.…”
Section: Introductionmentioning
confidence: 99%