2021
DOI: 10.5937/spsunp2101017m
|View full text |Cite
|
Sign up to set email alerts
|

Creating a stop word dictionary in Serbian

Abstract: By using natural language processing techniques, it is possible to get a lot of information from the extraction of document topics through mapping of document key words or content-based classification of documents, etc. To get this information, an important step is to separate words that carries informative value in a sentence from those words that do not affect its meaning. By using dictionaries of stop words specific to each natural language, the marking of words that do not carry meaning in the sentence is … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 72 publications
0
2
0
Order By: Relevance
“…We removed the URLs, mentions, etc, using regular expressions. We used the list of stop words described by Marovac et al [ 61 ], which we extended with all the alternative names for COVID-19 and derivatives of the word “vaccine.” These terms naturally appear in most tweets since we applied them as our Twitter search keywords.…”
Section: Methodsmentioning
confidence: 99%
“…We removed the URLs, mentions, etc, using regular expressions. We used the list of stop words described by Marovac et al [ 61 ], which we extended with all the alternative names for COVID-19 and derivatives of the word “vaccine.” These terms naturally appear in most tweets since we applied them as our Twitter search keywords.…”
Section: Methodsmentioning
confidence: 99%
“…The file contains two columns: word and label. The label describes the type of words: auxiliary verbs (V), pronouns (PRON), adverbs (ADV), prepositions (PREP), conjunctions (CONJ), exclamations (EXCL), particles (PART) and abbreviations (ABBR) (Marovac et al, 2021).…”
Section: Dictionaries and Terminologiesmentioning
confidence: 99%