2015
DOI: 10.13053/cys-19-2-1550
|View full text |Cite
|
Sign up to set email alerts
|

Segmentation Strategies to Face Morphology Challenges in Brazilian-Portuguese/English Statistical Machine Translation and Its Integration in Cross-Language Information Retrieval

Abstract: Abstract.The use of morphology is particularly interesting in the context of statistical machine translation in order to reduce data sparseness and compensate any lack of training corpus. In this work, we propose several approaches to introduce morphology knowledge into a standard phrase-based machine translation system. We provide word segmentation using two different tools (COGROO and MORFESSOR) which allow to reduce the vocabulary and data sparseness. Then, we add to these segmentations the morphological in… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2015
2015
2016
2016

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 23 publications
0
1
0
Order By: Relevance
“…The vector space model [ 4 ] is employed in many text processing problems, such as information retrieval [ 5 ], text categorization, authorship attribution, recognizing textual entailment [ 6 , 7 , 8 ], sentiment analysis [ 9 ], etc.…”
Section: State Of the Artmentioning
confidence: 99%
“…The vector space model [ 4 ] is employed in many text processing problems, such as information retrieval [ 5 ], text categorization, authorship attribution, recognizing textual entailment [ 6 , 7 , 8 ], sentiment analysis [ 9 ], etc.…”
Section: State Of the Artmentioning
confidence: 99%