Proceedings of the 30th ACM International Conference on Information &Amp; Knowledge Management 2021
DOI: 10.1145/3459637.3481941
|View full text |Cite
|
Sign up to set email alerts
|

One Model to Serve All

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
27
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 122 publications
(27 citation statements)
references
References 38 publications
0
27
0
Order By: Relevance
“…However, the challenges encountered by multi-task learning are more complicated in the multi-domain scenario. The task label space of different input data sources is not consistent, so all domains sharing one multi-task model will cause the "multi-domain seesaw" problem [31]. In real industrial scenarios, the cost of training different multi-task models in different domains is unacceptable, and it is impossible to explore correlations between different domains to improve performance.…”
Section: Relate Workmentioning
confidence: 99%
See 1 more Smart Citation
“…However, the challenges encountered by multi-task learning are more complicated in the multi-domain scenario. The task label space of different input data sources is not consistent, so all domains sharing one multi-task model will cause the "multi-domain seesaw" problem [31]. In real industrial scenarios, the cost of training different multi-task models in different domains is unacceptable, and it is impossible to explore correlations between different domains to improve performance.…”
Section: Relate Workmentioning
confidence: 99%
“…However, mixing all the data directly and training with a unified model ignores the differences between domains and tasks. The inability to align embeddings with different semantics will result in domain seesaw [31] due to the different distributions of user behaviors and item candidates in multiple scenarios. Since different targets have distinctive sparsity and influence each other, the inability to balance the targets of multiple tasks can lead to the task seesaw [33].…”
Section: Introductionmentioning
confidence: 99%
“…More specifically, each scenario has a domain layer that only uses its own data to adjust parameters. Another classical approach STAR [20] added an extra shared tower and proposed a novel parameter fusion method. It multiplies the shared parameters of the extra tower and the customized parameter matrix of the specific scenario to obtain the final network processing parameters.…”
Section: Multi-tower Paradigmmentioning
confidence: 99%
“…• STAR [20] proposes a star topology to accommodate with the scenario-specific characteristics. Specifically, a shared network works as the center node for knowledge sharing and each scenario network connects only with the center node.…”
Section: Experimental Settingsmentioning
confidence: 99%
See 1 more Smart Citation