Proceedings of the 2018 Conference of the North American Chapter Of the Association for Computational Linguistics: Hu 2018
DOI: 10.18653/v1/n18-3011
|View full text |Cite
|
Sign up to set email alerts
|

Construction of the Literature Graph in Semantic Scholar

Abstract: We describe a deployed scalable system for organizing published scientific literature into a heterogeneous graph to facilitate algorithmic manipulation and discovery. The resulting literature graph consists of more than 280M nodes, representing papers, authors, entities and various interactions between them (e.g., authorships, citations, entity mentions). We reduce literature graph construction into familiar NLP tasks (e.g., entity extraction and linking), point out research challenges due to differences from … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
227
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 285 publications
(228 citation statements)
references
References 24 publications
1
227
0
Order By: Relevance
“…However, first attempts towards a more semantic representation of article content exist: Ammar et al [1] interlink the Semantic Scholar Corpus with DBpedia [25] and Unified Medical Language System (UMLS) [6] using entity linking techniques. Yaman et al [43] connect SciGraph with DBpedia person entities.…”
Section: Applications For Domain-independent Scientific Information Ementioning
confidence: 99%
See 1 more Smart Citation
“…However, first attempts towards a more semantic representation of article content exist: Ammar et al [1] interlink the Semantic Scholar Corpus with DBpedia [25] and Unified Medical Language System (UMLS) [6] using entity linking techniques. Yaman et al [43] connect SciGraph with DBpedia person entities.…”
Section: Applications For Domain-independent Scientific Information Ementioning
confidence: 99%
“…For academic search engines, Xiong et al [42] have shown that exploiting knowledge bases like Freebase can improve search results. However, the introduction of new scientific concepts occurs at a faster pace than knowledge base curation, resulting in a large gap in knowledge base coverage of scientific entities [1], e.g. the task geolocation estimation of photos from the Computer Vision field is neither present in Wikipedia nor in more specialised knowledge bases like Computer Science Ontology (CSO) [35] arXiv:2001.03067v1 [cs.IR] 9 Jan 2020 or "Papers with code" [32].…”
Section: Introductionmentioning
confidence: 99%
“…Selection: Figure 2 displays our systematic approach to select articles relevant to this survey, based on [31]. First, we collect data from DBLP [35] and Semantic Scholar [2]. We filter them by venue, retaining only articles from the 10 key conferences and journals in distributed systems listed in the caption of Table 1, including SC.…”
Section: Article Selection and Labelingmentioning
confidence: 99%
“…We further consider a task of generating abstracts for scientific papers (Ammar et al, 2018), where the input contains a paper title and scientific entities mentioned in the abstract. We use the AGENDA data processed by Koncel-Kedziorski et al (2019), where entities and their relations in the abstracts are extracted by SciIE (Luan et al, 2018).…”
Section: Task Iii: Paper Abstract Generationmentioning
confidence: 99%