Proceedings of the First Celtic Language Technology Workshop 2014
DOI: 10.3115/v1/w14-4607
|View full text |Cite
|
Sign up to set email alerts
|

Irish National Morphology Database: a high-accuracy open-source dataset of Irish words

Abstract: The Irish National Morphology Database is a human-verified, Official Standard-compliant dataset containing the inflected forms and other morpho-syntactic properties of Irish nouns, adjectives, verbs and prepositions. It is being developed by Foras na Gaeilge as part of the New English-Irish Dictionary project. This paper introduces this dataset and its accompanying software library Gramadán.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 0 publications
0
1
0
Order By: Relevance
“…In the minimal format, only the base form is stored explicitly in the entry. In the expanded format, in addition to the base form, male/wife/daughter forms and inflected forms are generated, added, and marked up, for example: <surname-irish> <form gender="male" case="nom"><pre>Ó</pre> Briain</form> <form gender="male" case="gen"><pre>Uí</pre> B<mut>h</mut>riain</form> <form gender="male" case="voc"><pre>Uí</pre> B<mut>h</mut>riain</form> <form gender="female" familyStatus="wife" case="nom"><pre>Uí</pre> B<mut>h</mut>riain</form> <form gender="female" familyStatus="wife" case="gen"><pre>Uí</pre> B<mut>h</mut>riain</form> <form gender="female" familyStatus="wife" case="voc"><pre>Uí</pre> B<mut>h</mut>riain</form> <form gender="female" familyStatus="daughter" case="nom"><pre>Ní</pre> B<mut>h</mut>riain</form> <form gender="female" familyStatus="daughter" case="gen"><pre>Ní</pre> B<mut>h</mut>riain</form> <form gender="female" familyStatus="daughter" case="voc"><pre>Ní</pre> B<mut>h</mut>riain</form> </surname-irish> This method of providing minimal and expanded formats is inspired by a similar distinction made in the Irish National Morphology Database (Měchura 2014). The algorithm which converts from minimal to expanded format exists in two implementations, once as an XSL stylesheet (available publicly for download) and once as a function in the C# programming language (used internally by our web application).…”
Section: Methodsmentioning
confidence: 99%
“…In the minimal format, only the base form is stored explicitly in the entry. In the expanded format, in addition to the base form, male/wife/daughter forms and inflected forms are generated, added, and marked up, for example: <surname-irish> <form gender="male" case="nom"><pre>Ó</pre> Briain</form> <form gender="male" case="gen"><pre>Uí</pre> B<mut>h</mut>riain</form> <form gender="male" case="voc"><pre>Uí</pre> B<mut>h</mut>riain</form> <form gender="female" familyStatus="wife" case="nom"><pre>Uí</pre> B<mut>h</mut>riain</form> <form gender="female" familyStatus="wife" case="gen"><pre>Uí</pre> B<mut>h</mut>riain</form> <form gender="female" familyStatus="wife" case="voc"><pre>Uí</pre> B<mut>h</mut>riain</form> <form gender="female" familyStatus="daughter" case="nom"><pre>Ní</pre> B<mut>h</mut>riain</form> <form gender="female" familyStatus="daughter" case="gen"><pre>Ní</pre> B<mut>h</mut>riain</form> <form gender="female" familyStatus="daughter" case="voc"><pre>Ní</pre> B<mut>h</mut>riain</form> </surname-irish> This method of providing minimal and expanded formats is inspired by a similar distinction made in the Irish National Morphology Database (Měchura 2014). The algorithm which converts from minimal to expanded format exists in two implementations, once as an XSL stylesheet (available publicly for download) and once as a function in the C# programming language (used internally by our web application).…”
Section: Methodsmentioning
confidence: 99%