Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery &Amp; Data Mining 2021
DOI: 10.1145/3447548.3467147
|View full text |Cite
|
Sign up to set email alerts
|

Pre-trained Language Model based Ranking in Baidu Search

Abstract: As the heart of a search engine, the ranking system plays a crucial role in satisfying users' information demands. More recently, neural rankers fine-tuned from pre-trained language models (PLMs) establish state-of-the-art ranking effectiveness. However, it is nontrivial to directly apply these PLM-based rankers to the large-scale web search system due to the following challenging issues: (1) the prohibitively expensive computations of massive neural PLMs, especially for long texts in the web-document, prohibi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
26
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3

Relationship

2
6

Authors

Journals

citations
Cited by 50 publications
(26 citation statements)
references
References 44 publications
0
26
0
Order By: Relevance
“…Note that a passage p consists of ๐‘‡ sentences p = {s ๐œ } ๐‘‡ ๐œ=1 . Following a previous study [52], a desirable re-ranker is a scoring function ๐‘“ * (โ€ข, โ€ข) that maximizes the consistency between its predictions (denoted as ลถq,P = {๐‘“ (q, p ๐œ… ) | p ๐œ… โˆˆ P}) and the ground truth labels (denoted as ๐‘Œ = {๐‘ฆ ๐œ… } ๐œ˜ ๐œ…=1 ), i.e.,…”
Section: Problem Formulation 31 Passage Re-rankingmentioning
confidence: 99%
See 1 more Smart Citation
“…Note that a passage p consists of ๐‘‡ sentences p = {s ๐œ } ๐‘‡ ๐œ=1 . Following a previous study [52], a desirable re-ranker is a scoring function ๐‘“ * (โ€ข, โ€ข) that maximizes the consistency between its predictions (denoted as ลถq,P = {๐‘“ (q, p ๐œ… ) | p ๐œ… โˆˆ P}) and the ground truth labels (denoted as ๐‘Œ = {๐‘ฆ ๐œ… } ๐œ˜ ๐œ…=1 ), i.e.,…”
Section: Problem Formulation 31 Passage Re-rankingmentioning
confidence: 99%
“…To this end, we mimic human judgment and only focus on the sentence of each passage that is the most related to a query [52].…”
Section: Algorithm 1: Meta-graph Construction Algorithmmentioning
confidence: 99%
“…In this section, we introduce recent works designing PTMs tailored for IR (Lee et al, 2019b;Chang et al, 2019;Ma et al, 2021b;Ma et al, 2021c;Boualili et al, 2020;Ma et al, 2021d;Zou et al, 2021;Liu et al, 2021d). General pre-trained models like BERT have achieved great success when applied to IR tasks on both the firststage retrieval and the re-ranking stage.…”
Section: Keyphrase Extractionmentioning
confidence: 99%
“…Despite the above-mentioned bi-encoder models, interactionbased methods are also widely used in many information retrieval systems [9,10,44,[48][49][50][51][52]. As such, another line of research for semantic matching is to model query-document interaction with DNNs [25,28,40,44,53]. However, they cannot cache the document embeddings offline, and thus are inefficient for retrieval.…”
Section: Related Work 21 Semantic Retrieval In Web Searchmentioning
confidence: 99%
“…They are preferred for ranking stage, which will not be further discussed in this paper. In our search engine, interaction-based methods are exploited to build the PLM-based ranking system [53].…”
Section: Related Work 21 Semantic Retrieval In Web Searchmentioning
confidence: 99%