Implementation of Random Forest Algorithm for Diarrhea Prediction
DOI:
https://doi.org/10.58471/ju-komi.v3i02.753Keywords:
Random Forest, Prediction, Diarrhea, Data Mining, ClassificationAbstract
Diarrhea is one of the leading causes of morbidity among toddlers in Indonesia. Environmental factors such as drinking water quality, sanitation, maternal hand hygiene, and immunization status contribute significantly to the incidence of diarrhea. This study aims to analyze the application of the Random Forest algorithm in developing a predictive model for diarrhea in toddlers using secondary data from a community health center (Puskesmas), consisting of 200 records divided into 150 training data and 50 testing data. The model was constructed by generating multiple decision trees and combining them using a majority voting technique. The results show that the Random Forest algorithm achieved an accuracy of 88%, precision of 77.78%, recall of 87.5%, F1-score of 82.35%, and specificity of 88.24%. These values indicate that Random Forest is quite reliable in detecting positive diarrhea cases, although some limitations remain in reducing misclassification of negative data. This study contributes to the utilization of machine learning algorithms, particularly Random Forest, as a decision-support tool in the health sector for diarrhea prevention among toddlers.
References
Kementerian Kesehatan Republik Indonesia. (2018). Riset Kesehatan Dasar 2018. Jakarta: Kemenkes RI.
Kementerian Kesehatan Republik Indonesia. (2020). Profil Kesehatan Indonesia 2020. Jakarta: Kemenkes RI.
Kementerian Kesehatan Republik Indonesia. (2022). Laporan Kesehatan Nasional 2022. Jakarta: Kemenkes RI.
Sepharni. (2022). Klasifikasi Penyakit Jantung Menggunakan Algoritma C4.5. Jurnal Informatika, 10(2), 75-92.
Depari, Widiastiwi, & Santoni. (2022). Perbandingan Algoritma Machine Learning dalam Klasifikasi Penyakit Jantung. Jurnal Kesehatan Digital, 15(3), 70-85.
Munggaran, & Hidayatulloh. (2015). Penerapan Algoritma C4.5 untuk Diagnosa Penyakit Diare Pada Anak Balita Berbasis Mobile. Jurnal Sistem Informasi, 8(1), 55-67.
Ente, et al. (2020). Klasifikasi Faktor-Faktor Penyebab Penyakit Diabetes Melitus Di Rumah Sakit Unhas Menggunakan Algoritma C4.5. Jurnal Ilmu Komputer, 12(2), 98-112.
Afifuddin, & Hakim. (2023). Deteksi Penyakit Diabetes Mellitus Menggunakan Algoritma Decision Tree Model Arsitektur C4.5. Jurnal Teknologi Informasi, 19(1), 33-45.
Prabowo, et al. (2023). Komparasi Tingkat Akurasi Random Forest dan Decision Tree C4.5 Pada Klasifikasi Data Penyakit Infertilitas. Jurnal Kesehatan Digital, 17(4), 88-102.
Kalimah. (2022). Klasifikasi Penyakit Diabetes Menggunakan Metode Decision Tree dan Random Forest. Jurnal Informatika Medis, 14(3), 67-79.
Masriadi. (2017). Epidemiologi Penyakit Diare. Makassar: Universitas Hasanuddin Press.
Purnama. (2016). Penyakit Diare dan Faktor Risikonya. Jakarta: Pustaka Kesehatan.
Simatupang. (2004). Rotavirus dan Perannya dalam Diare pada Anak. Jurnal Kedokteran Indonesia, 10(2), 44-55.
Nikma Kumala Sari, & Almansyah Lukito. (2017). Faktor Penyebab dan Pencegahan Diare pada Balita. Jurnal Kesehatan Masyarakat, 12(1), 78-91.
Hassan, & Alatas. (1985). Patogenesis dan Pencegahan Diare pada Anak. Jakarta: Balai Pustaka.
Kliegman, Marcdante, & Jenson. (2006). Nelson Textbook of Pediatrics. Philadelphia: Elsevier.
Breiman, L. (2001). Random Forests. Machine Learning, 45(1), 5-32.
Quinlan, J. R. (1993). C4.5: Programs for Machine Learning. Morgan Kaufmann.
Witten, I. H., Frank, E., & Hall, M. A. (2011). Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann.








