2013
DOI: 10.1007/s11434-013-5711-8
|View full text |Cite
|
Sign up to set email alerts
|

Language clustering with word co-occurrence networks based on parallel texts

Abstract: This study investigates the feasibility of applying complex networks to fine-grained language classification and of employing word co-occurrence networks based on parallel texts as a substitute for syntactic dependency networks in complex-network-based language classification. 14 word co-occurrence networks were constructed based on parallel texts of 12 Slavic languages and 2 non-Slavic languages, respectively. With appropriate combinations of major parameters of these networks, cluster analysis was able to di… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
66
0
4

Year Published

2014
2014
2021
2021

Publication Types

Select...
3
3
1

Relationship

1
6

Authors

Journals

citations
Cited by 85 publications
(70 citation statements)
references
References 20 publications
0
66
0
4
Order By: Relevance
“…Furthermore, in [10] they introduced the node selectivity measure that can distinguish the difference between normal and randomised text. Liu and Cong [14] constructed co-occurrence networks from text in different languages and used complex network parameters for the classification (hierarchical clustering) of 14 languages, where Croatian was amongst 12 Slavic. Different applications of linguistic network analysis in NLP includes: evaluation of language complexity [15], automatic summarisation [16] and evaluation of machine translation [17], authorship attribution [18] and text quality analysis [19].…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Furthermore, in [10] they introduced the node selectivity measure that can distinguish the difference between normal and randomised text. Liu and Cong [14] constructed co-occurrence networks from text in different languages and used complex network parameters for the classification (hierarchical clustering) of 14 languages, where Croatian was amongst 12 Slavic. Different applications of linguistic network analysis in NLP includes: evaluation of language complexity [15], automatic summarisation [16] and evaluation of machine translation [17], authorship attribution [18] and text quality analysis [19].…”
Section: Related Workmentioning
confidence: 99%
“…Various types of linguistic networks have already been studied: syntax networks [1,2], semantic networks [3], phonological networks [4], syllable networks [5,6], word co-occurrence networks [7][8][9][10][11][12][13][14][15][16][17][18][19]. In [3,20,21] a systematic methodological overview of linguistic complex networks principles is presented.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…So far, much research has been carried out, mainly concerned with the structure of syntactic dependency networks (Ferrer i Cancho 2005;Liu 2008;Chen and Liu 2011;Čech et al 2011), the patterns in syntactic dependency networks (Ferrer i Cancho et al 2004;, language development or language evolution (Ke and Yao 2008;Mukherjee et al 2013;Mehler et al 2011), language clustering and linguistic categorization (Liu 2010;Liu and Cong 2013;Gong et al 2012;Abramov and Mehler 2011), manual and machine translation (Amancio et al 2008;Amancio et al 2011), word sense disambiguation (Christiano and Raphael 2013), communication and interaction (Banisch et al 2010;Mehler et al 2010), the structure of semantic networks (Borge-Holthoefer and Arenas 2010; Liu 2009), phonetics (Arbesman et al 2010;Yu et al 2011), morphology (Čech andMačutek 2009;Liu and Xu 2011), parts of speech (Ferrer i Cancho et al 2007), Knowledge Networks (Allee 2007), cognitive networks (Mehler et al 2012).…”
Section: Introductionmentioning
confidence: 99%
“…Marco teórico y pregunta de la investigación Con el desarrollo de la tecnología informática se introduce la teoría de redes complejas (complex network) en la investigación lingüística (Ferrer i Cancho 2005;Liu 2008;Chen y Liu 2011;Liu y Cong 2013), lo que hace posible convertir el sistema lingüístico en una red compleja. Esa red compleja se compone de nodos y enlaces, y se emplea para hacer un análisis cuantitativo de manera precisa sobre las características de algún sistema lingüístico.…”
unclassified