2016
DOI: 10.1016/j.jbi.2016.09.002
|View full text |Cite
|
Sign up to set email alerts
|

Automated learning of domain taxonomies from text using background knowledge

Abstract: In this paper, we present an automated method for taxonomy learning, focusing on concept formation and hierarchical relation learning. To infer such relations, we partition the extracted concepts and group them into closely-related clusters using Hierarchical Agglomerative Clustering, informed by syntactic matching and semantic relatedness functions. We introduce a novel, unsupervised method for cluster detection based on automated dendrogram pruning, which is dynamic to each partition. We evaluate our approac… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
16
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 20 publications
(16 citation statements)
references
References 32 publications
0
16
0
Order By: Relevance
“…However, creating gold standard labeled datasets is laborious and subjective. Hoxha et al [28] recently developed Ontofier, an unsupervised ontology learning framework that uses the agglomerative hierarchical clustering to learn domain taxonomies. They used agglomerative hierarchical clustering to produce a dendrogram (i.e., a tree diagram) that can be pruned to be a taxonomy with parent-child relations.…”
Section: Related Workmentioning
confidence: 99%
“…However, creating gold standard labeled datasets is laborious and subjective. Hoxha et al [28] recently developed Ontofier, an unsupervised ontology learning framework that uses the agglomerative hierarchical clustering to learn domain taxonomies. They used agglomerative hierarchical clustering to produce a dendrogram (i.e., a tree diagram) that can be pruned to be a taxonomy with parent-child relations.…”
Section: Related Workmentioning
confidence: 99%
“…The other table is Concept Table (CT), which continues adding concepts to the table as new undefined words are added. [8] At the point when the words captured are feed from the table, the quantity of words is calculated. Checking the count to be lesser than the quantity of words acquired for the table, the technique proceeds.…”
Section: Algorithmmentioning
confidence: 99%
“…For example, our group recently developed an open source text mining tool called simiTerm to identify the terms in a text corpus that are contextually and syntactically similar to existing terms in an ontology [11]. Hoxha et al [8] developed Ontofier, an unsupervised ontology learning framework that uses agglomerative hierarchical clustering to learn domain taxonomies from biomedical texts. These methods, which focused on building lightweight ontologies, do not intend to enrich existing ontologies with new concepts and cannot identify the locations for them in the existing ontology.…”
Section: Background and Related Workmentioning
confidence: 99%
“…The traditional top-down approach for ontology development involves iterative discussions among ontology developers and domain experts, which is labor intensive and time consuming [7]. Thus, automated and semi-automated ontology learning methods, which may ease the burden of ontology developers and accelerate ontology development, are highly desired [812].…”
Section: Introductionmentioning
confidence: 99%