2004
DOI: 10.1145/984321.984324
|View full text |Cite
|
Sign up to set email alerts
|

Anchor text mining for translation of Web queries

Abstract: To discover translation knowledge in diverse data resources on the Web, this article proposes an effective approach to finding translation equivalents of query terms and constructing multilingual lexicons through the mining of Web anchor texts and link structures. Although Web anchor texts are wide-scoped hypertext resources, not every particular pair of languages contains sufficient anchor texts for effective extraction of translations for Web queries. For more generalized applications, the approach is design… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
36
0
1

Year Published

2006
2006
2015
2015

Publication Types

Select...
5
3
2

Relationship

1
9

Authors

Journals

citations
Cited by 61 publications
(37 citation statements)
references
References 8 publications
0
36
0
1
Order By: Relevance
“…Due to rapid increase of data around the web, it is very important to study how only the relevant data from different web sources can be extracted and processed for relevant tasks. Since most web pages contain tags and other non-content HTML characters [20], it is easy to extract content from any web page if we can exploit these tags. In our approach, we exploited HTML and XTML tags and used python library for web scrapping to extract the reviews from different websites.…”
Section: B Extraction Of Reviews From Webmentioning
confidence: 99%
“…Due to rapid increase of data around the web, it is very important to study how only the relevant data from different web sources can be extracted and processed for relevant tasks. Since most web pages contain tags and other non-content HTML characters [20], it is easy to extract content from any web page if we can exploit these tags. In our approach, we exploited HTML and XTML tags and used python library for web scrapping to extract the reviews from different websites.…”
Section: B Extraction Of Reviews From Webmentioning
confidence: 99%
“…Lu et al (2004) extracted translation pairs by mining web anchor texts and link structures. As an alternative, mixed-language webpages are analyzed by first retrieving texts that include both source and target languages from the web via search engines or simple rules, and then extracting translation pairs from the mixed-language texts utilizing various clues.…”
Section: Related Studiesmentioning
confidence: 99%
“…Lu, Chien and Lee [5] proposed a transitive translation approach to find translation equivalents of query terms and constructing multilingual lexicons through the mining of web anchor texts and link structures. The translation equivalents of a query term can be extracted via its translation in an intermediate language.…”
Section: Transitive Translation Approachmentioning
confidence: 99%