Proceedings of the Conference Recent Advances in Natural Language Processing - Large Language Models for Natural Language Proce 2023
DOI: 10.26615/978-954-452-092-2_097
|View full text |Cite
|
Sign up to set email alerts
|

huPWKP: A Hungarian Text Simplification Corpus

Noémi Prótár,
Dávid Márk Nemeskey

Abstract: In this article we introduce huPWKP, the first parallel corpus consisting of Hungarian standard language-simplified sentence pairs. It is the Hungarian translation of PWKP (Zhu et al., 2010), on which we performed some cleaning in order to improve its quality. We evaluated the corpus both with the help of human evaluators and by training a seq2seq model on both the Hungarian and the original (cleaned) English corpus. The Hungarian model performed slightly worse in terms of automatic metrics; however, the Engli… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 22 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?