2018
DOI: 10.13053/cys-22-4-3104
|View full text |Cite
|
Sign up to set email alerts
|

Dialectones: Finding Statistically Significant Dialectal Boundaries Using Twitter Data

Abstract: Most NLP applications assume that a particular language is homogeneous in the regions where it is spoken. However, each language varies considerably throughout its geographical distribution. To make NLP sensitive to dialects, a reliable, representative and up-to-date source of information that quantitatively represents such geographical variation is necessary. However, some of the current approaches have disadvantages such as the need for parameters, the disregard of the geographical coordinates in the analysi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 9 publications
0
2
0
Order By: Relevance
“…For instance, Spanish is a largely used language; nevertheless, it is used differently according to the country or even a more specific geographical location. Hence, the language could analyze at the regional level (Huang et al, 2016;Rodriguez-Diaz et al, 2018). In Rodriguez-Diaz et al (2018), the authors study Spanish language variations in Colombia.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…For instance, Spanish is a largely used language; nevertheless, it is used differently according to the country or even a more specific geographical location. Hence, the language could analyze at the regional level (Huang et al, 2016;Rodriguez-Diaz et al, 2018). In Rodriguez-Diaz et al (2018), the authors study Spanish language variations in Colombia.…”
Section: Introductionmentioning
confidence: 99%
“…Hence, the language could analyze at the regional level (Huang et al, 2016;Rodriguez-Diaz et al, 2018). In Rodriguez-Diaz et al (2018), the authors study Spanish language variations in Colombia. The analysis used unigram features, and the authors stated that it was challenging to compare Spanish variations against regions identified by other authors using classical dialectometry.…”
Section: Introductionmentioning
confidence: 99%