2016
DOI: 10.1016/j.csl.2016.03.002
|View full text |Cite
|
Sign up to set email alerts
|

Sentence alignment using local and global information

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(5 citation statements)
references
References 50 publications
0
5
0
Order By: Relevance
“…For building aligned paired sentences, first of all, the aligned English-Persian documents were extracted from Wikipedia. In the second step, in order to extract aligned sentences, a Maximum Entropy binary classifier has been trained in order to evaluate the local similarity between sentence pairs in Persian and English aligned documents [23]. MaxEnt classifiers are log-linear models that try to capture contextual information.…”
Section: -1-1-extracting Parallel Sentencesmentioning
confidence: 99%
See 2 more Smart Citations
“…For building aligned paired sentences, first of all, the aligned English-Persian documents were extracted from Wikipedia. In the second step, in order to extract aligned sentences, a Maximum Entropy binary classifier has been trained in order to evaluate the local similarity between sentence pairs in Persian and English aligned documents [23]. MaxEnt classifiers are log-linear models that try to capture contextual information.…”
Section: -1-1-extracting Parallel Sentencesmentioning
confidence: 99%
“…Barrón-Cedeño, Paramita, Clough, and Rosso have investigated cross-language similarities by incorporating various features such as character n-grams, cognateness, word count ratio, and an approach based on out-links in Wikipedia pages [24]. In this research, a collection of 12 features in four categories were exploited to train the maximum entropy classifier [23]. The four categories are as follows:…”
Section: -1-1-extracting Parallel Sentencesmentioning
confidence: 99%
See 1 more Smart Citation
“…This is somehow similar to our approach based on DTW, although we directly operate at the sentence level. To address the task of aligning documents in which sentences do not appear in the same order, Zamani et al (2016) present an approach based on Integer Linear Programming, while Quan et al (2018) propose instead an approach based on the length of the sentences to create one-to-one matches. Other examples of alignement between parallel corpora can be found in (Santos, 2011).…”
Section: Related Workmentioning
confidence: 99%
“…None of the reviewed approaches is directly applicable to our setting. The works of Das andPetrov (2011), Moore (2002), Quan et al (2018), andZamani et al (2016) are all based on symmetric corpora, while imposing a 1-1 alignment is not possible in our domain. Bamman et al (2010), Bentivogli and Pianta (2005), Eger et al (2018), Fossum and Abney (2005), and Yarowsky et al (2001) all address word-level alignment instead of sentence level-alignment.…”
Section: Related Workmentioning
confidence: 99%