Proceedings of the 20th International Conference on Computational Linguistics - COLING '04 2004
DOI: 10.3115/1220355.1220490
|View full text |Cite
|
Sign up to set email alerts
|

Extracting hyponyms of prespecified hypernyms from itemizations and headings in web documents

Abstract: This paper describes a method to acquire hyponyms for given hypernyms from HTML documents on the WWW. We assume that a heading (or explanation) of an itemization (or listing) in an HTML document is likely to contain a hypernym of the items in the itemization, and we try to acquire hyponymy relations based on this assumption. Our method is obtained by extending Shinzato's method (Shinzato and Torisawa, 2004) where a common hypernym for expressions in itemizations in HTML documents is obtained by using statistic… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2005
2005
2014
2014

Publication Types

Select...
2
2
2

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(6 citation statements)
references
References 6 publications
0
6
0
Order By: Relevance
“…As seen in previous studies [20,21], hyponymy relations are likely to appear in tables. Likewise, we also expect them to appear in that of statutes.…”
Section: Extraction Of Hyponymy Relations From Tablesmentioning
confidence: 82%
See 1 more Smart Citation
“…As seen in previous studies [20,21], hyponymy relations are likely to appear in tables. Likewise, we also expect them to appear in that of statutes.…”
Section: Extraction Of Hyponymy Relations From Tablesmentioning
confidence: 82%
“…Shinzato et al [20,21] proposed a method to acquire hyponyms for given hypernyms from HTML documents, assuming that the heading of an itemization in an HTML document is likely to contain a hypernym of the items in the itemization. For an itemization in the HTML format that corresponds to a table in a statute, a similar method could be applied to the statutes as long as a statute is described in a structural format, such as XML.…”
Section: Previous Work On Legal Text Processingmentioning
confidence: 99%
“…Moreover, as discussed previously, the semantic concepts specified in many domain ontologies are structured only in the subsumption manner of super-class and sub-class, rather than the more specific is-a, part-of, and related-to, the ones developed by [31,46] and [136]. Some attempted to describe more specified relations, like [13,103] for is-a, [33,92] for part-of, and [41] for related-to relations only. Tao et al [107,108] made a further progress from these works and portrayed the basic is-a, part-of, and relatedto semantic relations in one single computational model for concept representation.…”
Section: Semantic Concept Representationmentioning
confidence: 99%
“…Some research [16][17] tried to create a categorized vocabulary dictionary by using web mining. For example, one category is "programming language" and the vocabularies are "Java", "C", "Perl" and so on.…”
Section: Named Entity Extractionmentioning
confidence: 99%