Advances in Conceptual Modeling – Foundations and Applications
DOI: 10.1007/978-3-540-76292-8_9
|View full text |Cite
|
Sign up to set email alerts
|

Seed-Based Generation of Personalized Bio-ontologies for Information Extraction

Abstract: Abstract. Biologists usually focus on only a small, individualized, subdomain of the huge domain of biology. With respect to their sub-domain, they often need data collected from various different web resources. In this research, we provide a tool with which biologists can generate a sub-domain-size, user-specific ontology that can extract data from web resources. The central idea is to let a user provide a seed, which consists of a single data instance embedded within the concepts of interest. Given a seed, t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
4
0

Publication Types

Select...
2
2
2

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 8 publications
0
4
0
Order By: Relevance
“…However, since automated tools usually require a huge training corpus, making intensive use of natural language processing algorithms (e.g., Aqualog [22], Watson [4]), the results are not always satisfactory [10], especially when there is the necessity to infer a higher morphological or semantic level in information retrieval systems. Pseudo-relevance feedback (PRF) is a technique used to automatically expand search queries by¯nding semantically relevant expansion words in top ranked documents [23].…”
Section: Related Work and State Of The Artmentioning
confidence: 99%
“…However, since automated tools usually require a huge training corpus, making intensive use of natural language processing algorithms (e.g., Aqualog [22], Watson [4]), the results are not always satisfactory [10], especially when there is the necessity to infer a higher morphological or semantic level in information retrieval systems. Pseudo-relevance feedback (PRF) is a technique used to automatically expand search queries by¯nding semantically relevant expansion words in top ranked documents [23].…”
Section: Related Work and State Of The Artmentioning
confidence: 99%
“…Tao and Embley [11] have analyzed how different generelated information is stored in different biological data sources. This study has been very useful for us since they have detected a set of relevant data inconsistencies, data redundancies, some incomplete information, and a set of data defects.…”
Section: Related Workmentioning
confidence: 99%
“…But how this process is done and what format the generated ontologies have is not discussed in the paper. SIH [40] and TANGO [43] are two ongoing projects we are currently working on. SIH tries to generate user-specified ontologies depending on user-generated forms.…”
Section: Ontology Generationmentioning
confidence: 99%
“…Tools such as SIH [40], TANGO [43], and the one developed by Pivk [34] use structured information (HTML tables) as a source for learning ontologies. Structured information makes it easier to interpret new items and relations.…”
Section: Ontology Generationmentioning
confidence: 99%