2017
DOI: 10.1186/s12911-017-0465-x
|View full text |Cite
|
Sign up to set email alerts
|

Lightweight predicate extraction for patient-level cancer information and ontology development

Abstract: BackgroundKnowledge engineering for ontological knowledgebases is resource and time intensive. To alleviate these issues, especially for novices, automated tools from the natural language domain can assist in the development process of ontologies. We focus towards the development of ontologies for the public health domain and use patient-centric sources from MedlinePlus related to HPV-causing cancers.MethodsThis paper demonstrates the use of a lightweight open information extraction (OIE) tool to derive accura… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
6
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
3
2
2

Relationship

1
6

Authors

Journals

citations
Cited by 8 publications
(6 citation statements)
references
References 18 publications
0
6
0
Order By: Relevance
“…However, especially in the case of OIE approaches and due to concerns on scaling, the use of syntactic or semantic relation extraction techniques has been relatively sparse, with the exception of a few recent examples aiming at domain-specific knowledge extraction [5][6][7][8][9]. Most domain-specific information extraction approaches are focused primarily on evaluating the efficiency of different triple extraction tools on raw data, not taking useful pre-processing and post-processing strategies into account, thus resulting in a large number of potentially uninformative triples [10][11][12]. There exist a few systems that go beyond triple extraction by implementing a more thorough preprocessing strategy, including coreference resolution or discourse analysis to improve the quality of the extracted triples; however, these do not address the scalability issues that arise from processing large corpora [13,14].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…However, especially in the case of OIE approaches and due to concerns on scaling, the use of syntactic or semantic relation extraction techniques has been relatively sparse, with the exception of a few recent examples aiming at domain-specific knowledge extraction [5][6][7][8][9]. Most domain-specific information extraction approaches are focused primarily on evaluating the efficiency of different triple extraction tools on raw data, not taking useful pre-processing and post-processing strategies into account, thus resulting in a large number of potentially uninformative triples [10][11][12]. There exist a few systems that go beyond triple extraction by implementing a more thorough preprocessing strategy, including coreference resolution or discourse analysis to improve the quality of the extracted triples; however, these do not address the scalability issues that arise from processing large corpora [13,14].…”
Section: Related Workmentioning
confidence: 99%
“…The most likely reservoir was bats, with evidence that the virus was transmitted to a human through an intermediate host, probably a palm civet or raccoon dog (8,9). In less than a year, SARS-CoV infected 8098 people in 26 countries, of whom 774 died (10,11). Approximately 25% of the patients developed organ failure, most often acute respiratory distress syndrome (ARDS), requiring admission to an intensive care unit (ICU), while the case fatality rate (CFR) was 9.6%.…”
Section: Cord-19 Article Id: D99dbae98cc9705d9b5674bb6eb66560b4434305mentioning
confidence: 99%
“…As early as 2004, Smith and Fellbaum used online health information sources to build the Medical WordNet from two corpora, namely Medical FactNet and Medical BeliefNet [35]. Recent work on semi/fully-automated ontology learning focuses on the extraction of terms and predicates (i.e., triples) from a text corpus [12, 36, 37]. For example, our group recently developed an open source text mining tool called simiTerm to identify the terms in a text corpus that are contextually and syntactically similar to existing terms in an ontology [11].…”
Section: Background and Related Workmentioning
confidence: 99%
“…While Ru et al focused on clinician users, Amith et al focused on patients. Amith et al developed an ontology-based approach for representing knowledge about HPV-causing cancer for patients [2]. Although focusing on cancer related knowledge, their approach can presumably be applied to any other domain for creating ontologies.…”
Section: Summary Of Selected Papers In the Thematic Issuementioning
confidence: 99%
“…These papers cover a wide range of topics including Knowledge and Data Personalization [1,2], Social Media Applications to Healthcare [3,4], Clinical Natural Language Processing [5,6], Patient Safety Analyses [7,8], and Data Mining Using Electronic Health Records [9,10].…”
Section: Summary Of Selected Papers In the Thematic Issuementioning
confidence: 99%