2019
DOI: 10.48550/arxiv.1910.03878
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Engineering for a Science-Centric Experimentation Platform

Abstract: Netflix is an internet entertainment service that routinely employs experimentation to guide strategy around product innovations. As Netflix grew, it had the opportunity to explore increasingly specialized improvements to its service, which generated demand for deeper analyses supported by richer metrics and powered by more diverse statistical methodologies. To facilitate this, and more fully harness the skill sets of both engineering and data science, Netflix engineers created a science-centric experimentatio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
3
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 22 publications
0
3
0
Order By: Relevance
“…Experimentation is ubiquitous in online services such as Facebook, LinkedIn [29], Netflix [9], etc., where the effects of product changes are explicitly tested and analyzed in randomized trials. Interference, sometimes referred to as network effects in the context of online social networks, is a threat to the validity of these randomized trials as the presence of interference violates the stable unit treatment value assumption (SUTVA, see e.g.…”
Section: Introductionmentioning
confidence: 99%
“…Experimentation is ubiquitous in online services such as Facebook, LinkedIn [29], Netflix [9], etc., where the effects of product changes are explicitly tested and analyzed in randomized trials. Interference, sometimes referred to as network effects in the context of online social networks, is a threat to the validity of these randomized trials as the presence of interference violates the stable unit treatment value assumption (SUTVA, see e.g.…”
Section: Introductionmentioning
confidence: 99%
“…A/B tests are widely used as the gold standard for understanding how a new algorithm will behave at scale. Almost all large tech companies routinely use A/B tests to evaluate changes before deploying them [18,22,34,48,53,58,63,70,76]. Networking research often includes the results of A/B tests, and uses them to justify new algorithms [17-19, 24, 25, 29, 42, 46, 49, 55, 57, 58, 60, 63, 72, 87].…”
Section: Introductionmentioning
confidence: 99%
“…It is common to find mature software and engineering systems for an experimentation platform (Fabijan et al 2017, Kohavi et al 2013, Deng, Lu, and Litz 2017, Diamantopoulos et al 2019). However, it is much less common to find mature software for statistics and causal inference that integrates into such systems.…”
Section: Introductionmentioning
confidence: 99%