Subword Segmentation for Machine Translation Based on Grouping Words by Potential Roots

Zuters, Janis; Strazds, Gus

doi:10.22364/bjmc.2019.7.4.04

Search citation statements

Order By: Relevance

Paper Sections

Select...

Methods1

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2021

Publication Types

Select...

Other1

Relationship

Self Cite0

Independent1

Authors

Journals

Cited by 1 publication

(1 citation statement)

References 7 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For segmentation of Latvian text, we have applied a GenSeg tool, described in (Zuters and Strazds, 2019) to preprocess the dialog file, so that the input now consisted of the messages in an already segmented form, leaving the rest of the process exactly as before -so that the run of the model on segmented versus unsegmented data differed only in the input file. Having fixed the metaparameters at 64 hidden units and vocabulary size 100, we found that subword segmentation improved the resulting model accuracy by 1.25% (z-score = -4.02).…”

Section: Methodsmentioning

confidence: 99%