Proceedings of the 24th International Conference on World Wide Web 2015
DOI: 10.1145/2736277.2741116
|View full text |Cite
|
Sign up to set email alerts
|

Future User Engagement Prediction and Its Application to Improve the Sensitivity of Online Experiments

Abstract: Modern Internet companies improve their services by means of data-driven decisions that are based on online controlled experiments (also known as A/B tests). To run more online controlled experiments and to get statistically significant results faster are the emerging needs for these companies. The main way to achieve these goals is to improve the sensitivity of A/B experiments. We propose a novel approach to improve the sensitivity of user engagement metrics (that are widely used in A/B tests) by utilizing pr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
18
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 38 publications
(18 citation statements)
references
References 27 publications
(120 reference statements)
0
18
0
Order By: Relevance
“…Online controlled experiments, such as A/B testing or interleaving, have become widely used techniques for controlling and improving search quality based on data-driven decisions [19]. This methodology has been adopted widely [3,7,9,31]. An A/B test is a between-subject test designed to compare two variants of a method (e.g., ranking on the SERP, ad ranking, colors and fonts of the web result title) at the same time by exposing them to two user groups and by measuring the difference between them in terms of a key metric (e.g., revenue, number of visits, etc.…”
Section: Background and Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Online controlled experiments, such as A/B testing or interleaving, have become widely used techniques for controlling and improving search quality based on data-driven decisions [19]. This methodology has been adopted widely [3,7,9,31]. An A/B test is a between-subject test designed to compare two variants of a method (e.g., ranking on the SERP, ad ranking, colors and fonts of the web result title) at the same time by exposing them to two user groups and by measuring the difference between them in terms of a key metric (e.g., revenue, number of visits, etc.…”
Section: Background and Related Workmentioning
confidence: 99%
“…There are many existing studies towards better online evaluation that are devoted to improving the sensitivity of our measurement methods [28], inventing new metrics [8,10] or improving existing ones [9]. An important goal of recent studies is to make metrics more consistent with long-term goals [19].…”
Section: Background and Related Workmentioning
confidence: 99%
“…While loyalty and popularity essentially make sense for relative comparison of websites, activity enables measuring engagement for a particular website independently of other websites. The most commonly used activity metrics include number of queries per session, number of clicks, number of clicks per query, dwell (presence) time (see e.g., [10,32]). …”
Section: Related Workmentioning
confidence: 99%
“…Recently online controlled experiments, such as A/B testing, have become widely used a technique for controlling and improving search quality based on data-driven decisions [155]. This methodology has been adopted by many leading search companies such as Bing [62], Google [223], Facebook [20], and Yandex [71]. An A/B test is designed to compare two variants of a method (e. g. ranking on SERP, ads ranking at the same time by exposing them to two user groups and by measuring the difference between them in terms of a key metric (e. g. the revenue, the number of visits, etc.…”
Section: Search Quality Evaluationmentioning
confidence: 99%
“…), also known as an overall evaluation criterion. There are many existing studies towards better online evaluation which were devoted to inventing new metrics [70,73] or improving existing ones [71]. The main goal of these studies was to make these metrics more consistent with the long-term goals [155].…”
Section: Search Quality Evaluationmentioning
confidence: 99%