2020
DOI: 10.1145/3417996
|View full text |Cite
|
Sign up to set email alerts
|

Learning Unsupervised Knowledge-Enhanced Representations to Reduce the Semantic Gap in Information Retrieval

Abstract: The semantic mismatch between query and document terms-i.e., the semantic gap-is a long-standing problem in Information Retrieval (IR). Two main linguistic features related to the semantic gap that can be exploited to improve retrieval are synonymy and polysemy. Recent works integrate knowledge from curated external resources into the learning process of neural language models to reduce the effect of the semantic gap. However, these knowledge-enhanced language models have been used in IR mostly for re-ranking … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
16
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 18 publications
(16 citation statements)
references
References 58 publications
0
16
0
Order By: Relevance
“…The Scientific Literature document set for TREC PM 2017 (PM17) and 2018 (PM18) consists of a set of 26, 759, 399 MEDLINE abstracts, plus two additional sets: (i) 37, 007 abstracts from recent proceedings of the American Society of Clinical Oncology (ASCO), 8 and (ii) 33, 018 abstracts from recent proceedings of the American Association for Cancer Research (AACR). 9 These additional sets were added to increase the set of potentially relevant treatment information. Indeed, relevant literature articles can guide precision oncologists to the best-known treatment options for the patient's condition.…”
Section: Scientific Literaturementioning
confidence: 99%
See 3 more Smart Citations
“…The Scientific Literature document set for TREC PM 2017 (PM17) and 2018 (PM18) consists of a set of 26, 759, 399 MEDLINE abstracts, plus two additional sets: (i) 37, 007 abstracts from recent proceedings of the American Society of Clinical Oncology (ASCO), 8 and (ii) 33, 018 abstracts from recent proceedings of the American Association for Cancer Research (AACR). 9 These additional sets were added to increase the set of potentially relevant treatment information. Indeed, relevant literature articles can guide precision oncologists to the best-known treatment options for the patient's condition.…”
Section: Scientific Literaturementioning
confidence: 99%
“…Otherwise, to use pyndri, we should first have had indexed the collections using Indri [215] -which is built in C++. 9 As in the original paper, we remove stopwords using the Indri stoplist, 10 and we do not perform stemming.…”
Section: Implementation Detailsmentioning
confidence: 99%
See 2 more Smart Citations
“…The AraPlagDet corpus was used to evaluate their work using precision, recall, and F-score metrics, and achieved 82.4 percent precision, 93.2 percent recall and 87.5 percent F-score. Agosti et al (2020) proposed an unsupervised neural framework for information retrieval. Semantic indexing, synonymy, and polysemy were used to eliminate semantic gaps, indicating a mismatch between document terms and queries.…”
Section: Related Workmentioning
confidence: 99%