2010
DOI: 10.1093/comjnl/bxq074
|View full text |Cite
|
Sign up to set email alerts
|

Outlier Detection in Cross-Context Link Discovery for Creative Literature Mining

Abstract: This paper investigates the role of outliers in literature-based knowledge discovery. It shows that detecting interesting outliers which appear in the literature on a given phenomenon can help the expert to find implicit relationships among concepts of different domains. The underlying assumption is that while the majority of articles in the given scientific domain describe matters related to a common understanding of the domain, the exploration of outliers may lead to the detection of scientifically interesti… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
36
0

Year Published

2011
2011
2017
2017

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 23 publications
(36 citation statements)
references
References 37 publications
0
36
0
Order By: Relevance
“…Results obtained by Sluban et al [10] and by Petrič et al [9] confirm the hypothesis that most bridging terms appear in outlier documents and that by considering only outlier documents the search space for b-term identification can be largely reduced. In this way, we can substantially reduce the search space for finding b-term candidates.…”
Section: Outlier Document Detectionmentioning
confidence: 56%
See 4 more Smart Citations
“…Results obtained by Sluban et al [10] and by Petrič et al [9] confirm the hypothesis that most bridging terms appear in outlier documents and that by considering only outlier documents the search space for b-term identification can be largely reduced. In this way, we can substantially reduce the search space for finding b-term candidates.…”
Section: Outlier Document Detectionmentioning
confidence: 56%
“…An example tool that we have developed in previous research is CrossBee [8], a web-based tool for bridging term (b-term) discovery and exploration, which implements an ensemble based term ranking approach to finding new connections between two predefined domains, represented by two user defined sets of biomedical articles. The research conducted by Petrič et al [9] and Sluban et al [10] complements this research by showing that bridging terms are substantially more frequent in documents that are outlier documents of their own domain, compared to their frequency in normal, non-outlier documents. Analogously to statistics, where an outlier is defined as an observation that falls outside the overall pattern of a distribution [11], an outlier document in the field of LBD is a document that lies outside the main group of documents of its own domain and is, therefore, in two domain settings more similar to the documents of the other explored domain than to the documents of the domain of its origin.…”
Section: Introductionmentioning
confidence: 75%
See 3 more Smart Citations