2019
DOI: 10.1016/j.is.2018.12.003
|View full text |Cite
|
Sign up to set email alerts
|

Wiser: A semantic approach for expert finding in academia based on entity linking

Abstract: We present WISER, a new semantic search engine for expert finding in academia. Our system is unsupervised and it jointly combines classical language modeling techniques, based on text evidences, with the Wikipedia Knowledge Graph, via entity linking.WISER indexes each academic author through a novel profiling technique which models her expertise with a small, labeled and weighted graph drawn from Wikipedia. Nodes in this graph are the Wikipedia entities mentioned in the author's publications, whereas the weigh… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
27
0
2

Year Published

2019
2019
2021
2021

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 53 publications
(29 citation statements)
references
References 38 publications
0
27
0
2
Order By: Relevance
“…Finally, we mention the problem of testing our approach and the other known ones over datasets of a different type than news and over other promising domains, such as expert finding in academia . Despite a different kind of textual data, our system Swat could actually be able to correctly detect the salient entities both in web pages and research papers in a similar fashion as already done with news: classical information extraction systems designed their algorithms only on the top of positional and frequency signals (which are also deployed by Swat ), but without using Wikipedia entities (and relative annotation‐ and relatedness‐based features) as salient elements.…”
Section: Discussionmentioning
confidence: 99%
“…Finally, we mention the problem of testing our approach and the other known ones over datasets of a different type than news and over other promising domains, such as expert finding in academia . Despite a different kind of textual data, our system Swat could actually be able to correctly detect the salient entities both in web pages and research papers in a similar fashion as already done with news: classical information extraction systems designed their algorithms only on the top of positional and frequency signals (which are also deployed by Swat ), but without using Wikipedia entities (and relative annotation‐ and relatedness‐based features) as salient elements.…”
Section: Discussionmentioning
confidence: 99%
“…Most of the generative probabilistic models and voting models cannot mine the hidden information that can be found in social networks and that cannot be mined from the documents, and they are 'one-step' relevance models [4]. The authors of reference [46] deployed entity linking, relatedness, and entity embeddings to create a novel WEM profile for academia experts. They built their model based on a weighted and labeled graph drawn from Wikipedia.…”
Section: Network-based Modelsmentioning
confidence: 99%
“…It contains both Dutch and English documents, and it has five different ground-truths-GT1 to GT5. GT5 is considered the most recent and complete ground-truth [46]. Additionally, CiteSeer and DBLP offer the information of computer scientific literatures on main journals and conference proceedings.…”
Section: Rq5: What Are the Datasets That Have Been Used For Expert Fimentioning
confidence: 99%
“…For example, an application such as question answering may be improved by relying on relation extraction (RE) (Hu et al, 2019;Yu et al, 2017), coreference resolution (Bhattacharjee et al, 2020;Gao et al, 2019), named entity recognition (NER) (Molla et al, 2006;Singh et al, 2018), and entity linking (EL) (Broscheit, 2019;Chen et al, 2017) components. This also holds for other applications such as personalized news recommendation (Karimi et al, 2018;Wang et al, 2018Wang et al, , 2019, fact checking (Thorne & Vlachos, 2018;Zhang & Ghorbani, 2020), opinion mining (Sun et al, 2017), semantic search (Cifariello et al, 2019), and conversational agents (Roller et al, 2020). The last decade has shown a growing interest in IE datasets suitably annotated for developing multi-task models where each of the tasks (e.g., NER, RE, etc.)…”
Section: Introductionmentioning
confidence: 95%