Proceedings of the 16th Conference on Computational Linguistics - 1996
DOI: 10.3115/992628.992719
|View full text |Cite
|
Sign up to set email alerts
|

Learning bilingual collocations by word-level sorting

Abstract: This paper I)roposes ;t new tnethod for learning bilingual colloca, tions from sentence-aligned paralM corpora. Our method COml)ris('s two steps: (1) extracting llseftll word chunks (n-grmns) by word-level sorting and (2) constructing bilingua,l ('ollocations t)y combining the word-(;hunl(s a(-quired iu stag(' (1). We apply the method to a very ('hallenging text l)~tir: a stock market 1)ullet;in in Japanese and il;s abstract in En-glish. I)om;tin sl)ecific collocations are well captured ewm if they were not co… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
14
0

Year Published

1997
1997
2019
2019

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 19 publications
(14 citation statements)
references
References 12 publications
0
14
0
Order By: Relevance
“…There has been a growing interest in corpus-based approaches which retrieve collocations from large corpora (Nagao and Mori, 1994), (Kupiec, 1993), (Fung, 1995), (Kitamura and Matsumoto, 1996), (Smadja, 1993), (Smadja et al, 1996), (Haruno et al, 1996). Although these approaches achieved good results for the task considered, most of them aim to extract fixed collocations, mainly noun phrases, and require the information which is dependent on each language such as dictionaries and parts of speech.…”
Section: Introductionmentioning
confidence: 99%
“…There has been a growing interest in corpus-based approaches which retrieve collocations from large corpora (Nagao and Mori, 1994), (Kupiec, 1993), (Fung, 1995), (Kitamura and Matsumoto, 1996), (Smadja, 1993), (Smadja et al, 1996), (Haruno et al, 1996). Although these approaches achieved good results for the task considered, most of them aim to extract fixed collocations, mainly noun phrases, and require the information which is dependent on each language such as dictionaries and parts of speech.…”
Section: Introductionmentioning
confidence: 99%
“…This method retrieves fixed collocations with high accuracy but may ignore collocations of exceptional types. Haruno et al (1996) constructed collocations by iteratively combining a couple of strings 5 of high mutual information. But the mutual information is estimated inadequately low when the cohesiveness between the two strings is greatly different.…”
Section: Related Workmentioning
confidence: 99%
“…There has been growing interest in corpus-based approaches that retrieve collocations from large corpora (Nagao, Makoto, and Mori 1994;Ikehara, Shirai, and Uchino 1996;Kupiec 1993;Fung 1995;Kitamura and Matsumoto 1996;Smadja 1993;Smadja, McKeown, and Hatzivassiloglou 1996;and Haruno, Ikehara, and Yamazaki 1996). As collocations have a large variety of forms, these approaches focus on fixed collocations depending on their points of view.…”
Section: Introductionmentioning
confidence: 99%
“…Since the number of Japanese articles is far greater than that of English articles, this rate with Japanese index terms becomes lower for the similarity lower bounds L d ≤ 0.4. 7 It is also very important to note that the results of this paper can be easily improved by employing more sophisticated techniques of estimating bilingual compound term correspondences from parallel corpora (e.g., [2]), especially in the performance of selecting appropriate monolingual compound terms in each language.…”
Section: Related Workmentioning
confidence: 99%