2014
DOI: 10.1007/978-3-319-10816-2_12
|View full text |Cite
|
Sign up to set email alerts
|

Feature Exploration for Authorship Attribution of Lithuanian Parliamentary Speeches

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
1
1
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 9 publications
0
3
0
Order By: Relevance
“…Finally, this approach could be explored across languages to demonstrate that its usefulness is not limited to an American context. There is already work that explores classifying authors by political affiliation in multipole languages and we would hope that here too reference to domain specific knowledge would be of use (Abd et al, 2020;Kapočiūtė-Dzikienė et al, 2014;Lapponi et al, 2018).…”
Section: Future Directionsmentioning
confidence: 99%
“…Finally, this approach could be explored across languages to demonstrate that its usefulness is not limited to an American context. There is already work that explores classifying authors by political affiliation in multipole languages and we would hope that here too reference to domain specific knowledge would be of use (Abd et al, 2020;Kapočiūtė-Dzikienė et al, 2014;Lapponi et al, 2018).…”
Section: Future Directionsmentioning
confidence: 99%
“…On the other hand, the number of prefixes is not large: in Lithuanian there are only thirty six prefixes [28], while modern English has fifty seven prefixes. Lithuanian is highly inflective, ambiguous (47 per cent of words are ambiguous), has rich vocabulary (0.5 million headwords) and has complex word derivation system (e.g., seventy eight suffixes for diminutives) [29]. Verbs have 3 conjugations, and are inflected by four tenses, three persons, two numbers, and three moods.…”
Section: Language Specific Featuresmentioning
confidence: 99%
“…There is a significant difference between frequency of unigrams in Lithuanian and in other (English, Polish, Serbian) languages (see Table I). Previous experiments in authorship identification using Lithuanian texts have demonstrated that content-features are more useful compared with function words or POS tags [29], while best results were obtained with word-level character tetra-grams and a set of lexical, morphological, and character features [30].…”
Section: Language Specific Featuresmentioning
confidence: 99%