2020
DOI: 10.17576/apjitm-2020-0902-01
|View full text |Cite
|
Sign up to set email alerts
|

Quantifying Semantic Shift Visually on a Malay Domain Specific Corpus Using Temporal Word Embedding Approach

Abstract: In this study, we propose an alternative approach to analyzing a domain-specific time series corpus for detecting word evolution. The method trains a target corpus in time series into a temporal word embedding (TWE) model. The advantage of TWE is that one can see how the meaning of a word changes over time. We have chosen the TWEC approach to model a Malay domain-specific time-series corpus, the Malaysian Hansard Corpus (MHC), to a TWE model and called the model as MHC-TWEC. Two primary analyses, i.e., self-si… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 0 publications
0
1
0
Order By: Relevance
“…Even though the data used is from parliament, it is also relevant to general use of the Malay language. In addition, many previous studies have utilised the MHC (Nor Fariza, Anis Nadiah, Azhar, Imran & Sabrina, 2019;Norsimah, Azhar, Anis Nadiah & Imran, 2019;Sabrina, Nor Fariza, Azhar & Anis Nadiah, 2020;Sabrina, Saidah et al, 2020). It is expected that the production of Malay stop words relating to the corpus will assist future research in terms of stop word removal and text processing in general.…”
Section: Introductionmentioning
confidence: 99%
“…Even though the data used is from parliament, it is also relevant to general use of the Malay language. In addition, many previous studies have utilised the MHC (Nor Fariza, Anis Nadiah, Azhar, Imran & Sabrina, 2019;Norsimah, Azhar, Anis Nadiah & Imran, 2019;Sabrina, Nor Fariza, Azhar & Anis Nadiah, 2020;Sabrina, Saidah et al, 2020). It is expected that the production of Malay stop words relating to the corpus will assist future research in terms of stop word removal and text processing in general.…”
Section: Introductionmentioning
confidence: 99%