2023
DOI: 10.1111/ijal.12485
|View full text |Cite
|
Sign up to set email alerts
|

In the melting pot of web‐crawled texts: The challenges of extracting English words from Croatian corpora

Abstract: The focus of this paper are English words and phrases used in Croatian which, unlike loanwords, have not undergone major adaptations at the orthographic, phonetic, or other levels apart from being influenced by the inflectional system of the recipient language. A list of English words in Croatian corpora was compiled using automatic algorithm extraction, corpus query language in Sketch Engine, and manual word list evaluation with the end goal of publishing the first comprehensive online database of English wor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 33 publications
0
0
0
Order By: Relevance