2021
DOI: 10.17576/gema-2021-2102-01
|View full text |Cite
|
Sign up to set email alerts
|

Domain-specific Stop Words in Malaysian Parliamentary Debates 1959 – 2018

Abstract: Removal of stop words is essential in Natural Language Processing and text-related analysis. Existing works on Malay stop words are based on standard Malay and Quranic/Arabic translations into Malay. Thus, there is a lack of domain-specific stop word list, making it discordant for processing of Malay parliamentary discourse. In this paper, we propose a semantic approach towards identifying and removing Malay, conventional Malay spelling and English functional words in analysing a time-series corpus, namely the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 13 publications
0
1
0
Order By: Relevance
“…Removal of stop words is essential in Natural Language Processing (NLP) tasks and text analysis [ 2 ]. This process can be followed routine using a pre-defined library or using a list of stop words.…”
Section: Proposed Methodologymentioning
confidence: 99%
“…Removal of stop words is essential in Natural Language Processing (NLP) tasks and text analysis [ 2 ]. This process can be followed routine using a pre-defined library or using a list of stop words.…”
Section: Proposed Methodologymentioning
confidence: 99%