2019
DOI: 10.25126/jtiik.2019611226
|View full text |Cite
|
Sign up to set email alerts
|

Evaluasi Daftar Stopword Bahasa Indonesia

Abstract: <p class="Abstrak">Pada sistem temu kembali informasi berbentuk teks maupun <em>text mining</em>, terdapat proses pengindeksan. Teks diproses dengan tujuan mengintisarikan informasi berbentuk teks tersebut. Salah satu proses yang dilakukan adalah <em>stopword filtering</em>,<em> </em> beberapa kata yang tidak layak diindeks diabaikan berdasar sebuah daftar. Di dalam sistem berbahasa Indonesia, terdapat beberapa versi daftar <em>stopword</em> yang tersedia b… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
3
0
7

Year Published

2020
2020
2022
2022

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(10 citation statements)
references
References 3 publications
0
3
0
7
Order By: Relevance
“…At first, the stop words (meaningless words that frequently appear in a sentence) are removed for effectiveness [16]. As our dataset is mainly Indonesian, the stop words are the Indonesian ones as defined by Rahutomo and Ririd [17]. Secondly, the text is tokenized to form a sequence of words [18] to avoid trivial mismatches caused by meaningless characters.…”
Section: Methodsmentioning
confidence: 99%
“…At first, the stop words (meaningless words that frequently appear in a sentence) are removed for effectiveness [16]. As our dataset is mainly Indonesian, the stop words are the Indonesian ones as defined by Rahutomo and Ririd [17]. Secondly, the text is tokenized to form a sequence of words [18] to avoid trivial mismatches caused by meaningless characters.…”
Section: Methodsmentioning
confidence: 99%
“…Filter Stopword, merupakan proses menghilangkan kata-kata yang sering muncul namun tidak ada pengaruh apapun terhadap ekstraksi sentimen. Kata yang termasuk seperti kata penunjuk waktu, kata tanya [12]; 4. Filter Token (By Length), merupakan proses menghapus kata dengan jumlah huruf tertentu melalui dengan parameter min chars 4 dan max chars 25 untuk membatasi jumlah huruf pada kata minimal 4 dan maksimal 25 pada teks [13].…”
Section: B Text Processingunclassified
“…Kamus stopword tidak tersedia baku sehingga memerlukan database indeks berisi daftar kata-kata stop words (stopword list). Beberapa peneliti telah membuat stopword list bahasa Indonesia antara lain Fadillah Z. Tala, Damian Doyle, dan Yudi Wibisono [8].…”
Section: Pendahuluanunclassified