The spread of Lumpy Skin Disease (LSD) that infects livestock is increasingly widespread in various parts of the world. Early detection of the disease’s spread is necessary so that the economic losses caused by LSD are not higher. The use of machine learning algorithms to predict the presence of a disease has been carried out, including in the field of animal health. The study aims to predict the presence of LSD in an area by utilizing the LSD dataset obtained from Mendeley Data. The number of lumpy infected cases is so low that it creates imbalanced data, posing a challenge in training machine learning models. Handling the unbalanced data is performed by sampling technique using the Random Under-sampling technique and Synthetic Minority Oversampling Technique (SMOTE). The Random Forest classification model was trained on sample data to predict cases of lumpy infection. The Random Forest classifier performs very well on both under-sampling and oversampling data. Measurement of performance metrics shows that SMOTE has a superior score of 1-2% compared to the use of Random Undersampling. Furthermore, Re-call rate, which is the metric we want to maximize in identifying lumpy cases, is superior when using SMOTE and has slightly better precision than Random Undersampling. This research only focuses on how to balance unbalanced data classes so that the optimization of the model has not been implemented, which creates opportunities for further research in the future.
Ketepatan dalam mengekstrak dan meringkas ribuan ulasan ke dalam beberapa topik menjadi kunci dalam pelaksanaan pengolahan data dan informasi lebih lanjut. Tidak terkecuali dalam industri perhotelan yang mana suatu ulasan merupakan sebuah aset yang apabila diolah dapat menghasilkan suatu informasi yang nantinya akan digunakan untuk kepentingan ekspansi bisnis dan keberlangsungan usahanya. Penelitian pemodelan topik ulasan hotel ini menggunakan Latent Dirichlet Allocation sebagai sarana untuk peringkasan dokumennya. Latent Dirichlet Allocation terbukti efektif dalam pengolahan peringkasan kata-kata dan banyak penelitian yang menggunakan metode ini. Adapun tujuan dari penelitian yang dilakukan untuk mendapatkan ringkasan kata-kata yang membentuk suatu topik yang mewakili keseluruhan ulasan yang mana dapat menghasilkan suatu data bagi manajemen hotel dalam mempertahankan eksistensinya dalam bisnis tersebut serta melakukan ekspansi dengan mempertimbangkan hasil dari pemodelan topik tersebut. Dari hasil pemodelan topik Latent Dirichlet Allocation yang telah dilakukan terhadap dataset review Tripadvisor dapat disimpulkan bahwa tren ulasan lebih banyak membahas mengenai lokasi, pelayanan, hotel, sarapan, resort dan pantai.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.