2008
DOI: 10.1007/978-3-540-85760-0_115
|View full text |Cite
|
Sign up to set email alerts
|

ParaMor: Finding Paradigms across Morphology

Abstract: Our algorithm, ParaMor, fared well in Morpho Challenge 2007 (Kurimo et al., 2007), a peer operated competition pitting against one another algorithms designed to discover the morphological structure of natural languages from nothing more than raw text. ParaMor constructs sets of affixes closely mimicking the paradigms of a language, and, with these structures in hand, annotates word forms with morpheme boundaries. Of the four language tracks in Morpho Challenge 2007, we entered ParaMor in English and German. M… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
22
0

Year Published

2008
2008
2015
2015

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 17 publications
(22 citation statements)
references
References 9 publications
0
22
0
Order By: Relevance
“…Such languages have a complex morphology, which cannot be covered by hand-made lexical resources. Some studies aim at solving this problem by learning inflectional paradigms from raw text corpora by clustering word forms in the corpus and analyzing the resulting clusters ( [9,8,3]). Other unsupervised methods applied to morphology induction are that of [15], [6] and [5], the latter using morphemes to encode a corpus by grouping morphemes into structures, called signatures, representing inflectional paradigms.…”
Section: Related Workmentioning
confidence: 99%
“…Such languages have a complex morphology, which cannot be covered by hand-made lexical resources. Some studies aim at solving this problem by learning inflectional paradigms from raw text corpora by clustering word forms in the corpus and analyzing the resulting clusters ( [9,8,3]). Other unsupervised methods applied to morphology induction are that of [15], [6] and [5], the latter using morphemes to encode a corpus by grouping morphemes into structures, called signatures, representing inflectional paradigms.…”
Section: Related Workmentioning
confidence: 99%
“…The baseline ParaMor algorithm which we extend here competed in the English and German tracks of Morpho Challenge 2007 (Monson et al, 2007b). The peer operated competitions of the Morpho Challenge series standardize the evaluation of unsupervised morphology induction algorithms (Kurimo et al, 2007a;.…”
Section: Unsupervised Morphology Inductionmentioning
confidence: 99%
“…Although Goldsmith (2001) and Goldsmith and Hu (2004) discuss ideas for segmenting individual words into more than two morphemes, the implemented Linguistica algorithm, as presented in Goldsmith (2006), permits at most a single morpheme boundary in each word. Second, ParaMor decouples the task of paradigm identification from that of word segmentation (Monson et al, 2007b). In contrast, morphology models in Linguistica inherently encode both a belief about paradigm structure on individual words as well as a segmentation of those words.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In addition, there are 7 new papers on this topic by Bernhard (2007), Bordag (2007), Chan (2007), McNamee (2007), Monson et al (2007), Pitler and Keshava (2007), and Tepper (2007). The approach described in this paper is a direct extension of Zeman (2007) and we will frequently refer to him.…”
Section: Introductionmentioning
confidence: 99%