2017
DOI: 10.1007/978-3-319-54241-6_14
|View full text |Cite
|
Sign up to set email alerts
|

Characterization of Written Languages Using Structural Features from Common Corpora

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 16 publications
0
1
0
Order By: Relevance
“…Rather than using techniques resulting in word embeddings that abstract words into non-interpretable vectors, we retain the nature of the word itself and instead use tools from network science to analyze the word co-occurrence structure in the corpus. Although NLP has leaned more towards probabilistic models and neural networks, network science has nevertheless found a wide range of applications in computational linguistics (Ferrer i Cancho et al 2004;Liu and Cong 2013;Cong and Liu 2014;Al Rozz et al 2017;Wang et al 2017;Chen et al 2018;Jiang et al 2019). In fact, some techniques commonly used in NLP, such as latent Dirichlet allocation, have been shown to parallel network science techniques (Gerlach et al 2018).…”
Section: Introductionmentioning
confidence: 99%
“…Rather than using techniques resulting in word embeddings that abstract words into non-interpretable vectors, we retain the nature of the word itself and instead use tools from network science to analyze the word co-occurrence structure in the corpus. Although NLP has leaned more towards probabilistic models and neural networks, network science has nevertheless found a wide range of applications in computational linguistics (Ferrer i Cancho et al 2004;Liu and Cong 2013;Cong and Liu 2014;Al Rozz et al 2017;Wang et al 2017;Chen et al 2018;Jiang et al 2019). In fact, some techniques commonly used in NLP, such as latent Dirichlet allocation, have been shown to parallel network science techniques (Gerlach et al 2018).…”
Section: Introductionmentioning
confidence: 99%