Implementation of C4.5 Algorithm for Diarrhea Prediction

Authors

  • Sipra Barutu Program Studi Teknologi Informasi, Universitas Panca Budi
  • Siska Simamora Program Studi Teknologi Informasi, Universitas Putra Abadi Langkat

DOI:

https://doi.org/10.58471/ju-komi.v3i02.751

Keywords:

C4.5 Algorithm, Prediction, Diarrhea, Data Mining, Classification

Abstract

Diarrheal disease remains one of the major health problems among toddlers in Indonesia. Environmental factors such as drinking water quality, sanitation, mothers’ hand hygiene, and immunization status play an important role in influencing the occurrence of diarrhea. This study aims to analyze the application of the C4.5 algorithm in developing a predictive model for diarrhea among toddlers using secondary data from a Public Health Center (Puskesmas), consisting of 200 records divided into 150 training data and 50 testing data. The analysis process was carried out through entropy calculation, information gain assessment, and decision tree construction to obtain classification patterns. The results showed that the C4.5 model achieved an accuracy of 92%, precision of 87.5%, recall of 87.5%, F1-score of 87.5%, and specificity of 94.12%. These values indicate that the C4.5 algorithm is capable of making predictions with a good level of accuracy and balance in detecting both positive and negative cases. This study contributes to the utilization of data mining, particularly the C4.5 algorithm, as a decision-support tool in the health sector for the prevention of diarrheal disease among toddlers.

References

Kementerian Kesehatan Republik Indonesia. (2018). Riset Kesehatan Dasar 2018. Jakarta: Kemenkes RI.

Kementerian Kesehatan Republik Indonesia. (2020). Profil Kesehatan Indonesia 2020. Jakarta: Kemenkes RI.

Kementerian Kesehatan Republik Indonesia. (2022). Laporan Kesehatan Nasional 2022. Jakarta: Kemenkes RI.

Sepharni. (2022). Klasifikasi Penyakit Jantung Menggunakan Algoritma C4.5. Jurnal Informatika, 10(2), 75-92.

Depari, Widiastiwi, & Santoni. (2022). Perbandingan Algoritma Machine Learning dalam Klasifikasi Penyakit Jantung. Jurnal Kesehatan Digital, 15(3), 70-85.

Munggaran, & Hidayatulloh. (2015). Penerapan Algoritma C4.5 untuk Diagnosa Penyakit Diare Pada Anak Balita Berbasis Mobile. Jurnal Sistem Informasi, 8(1), 55-67.

Ente, et al. (2020). Klasifikasi Faktor-Faktor Penyebab Penyakit Diabetes Melitus Di Rumah Sakit Unhas Menggunakan Algoritma C4.5. Jurnal Ilmu Komputer, 12(2), 98-112.

Afifuddin, & Hakim. (2023). Deteksi Penyakit Diabetes Mellitus Menggunakan Algoritma Decision Tree Model Arsitektur C4.5. Jurnal Teknologi Informasi, 19(1), 33-45.

Prabowo, et al. (2023). Komparasi Tingkat Akurasi Random Forest dan Decision Tree C4.5 Pada Klasifikasi Data Penyakit Infertilitas. Jurnal Kesehatan Digital, 17(4), 88-102.

Kalimah. (2022). Klasifikasi Penyakit Diabetes Menggunakan Metode Decision Tree dan Random Forest. Jurnal Informatika Medis, 14(3), 67-79.

Masriadi. (2017). Epidemiologi Penyakit Diare. Makassar: Universitas Hasanuddin Press.

Purnama. (2016). Penyakit Diare dan Faktor Risikonya. Jakarta: Pustaka Kesehatan.

Simatupang. (2004). Rotavirus dan Perannya dalam Diare pada Anak. Jurnal Kedokteran Indonesia, 10(2), 44-55.

Nikma Kumala Sari, & Almansyah Lukito. (2017). Faktor Penyebab dan Pencegahan Diare pada Balita. Jurnal Kesehatan Masyarakat, 12(1), 78-91.

Hassan, & Alatas. (1985). Patogenesis dan Pencegahan Diare pada Anak. Jakarta: Balai Pustaka.

Kliegman, Marcdante, & Jenson. (2006). Nelson Textbook of Pediatrics. Philadelphia: Elsevier.

Breiman, L. (2001). Random Forests. Machine Learning, 45(1), 5-32.

Quinlan, J. R. (1993). C4.5: Programs for Machine Learning. Morgan Kaufmann.

Witten, I. H., Frank, E., & Hall, M. A. (2011). Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann.

Downloads

Published

2025-04-18

How to Cite

Sipra Barutu, & Siska Simamora. (2025). Implementation of C4.5 Algorithm for Diarrhea Prediction. Jurnal Komputer Indonesia (Ju-Komi), 3(02), 53–60. https://doi.org/10.58471/ju-komi.v3i02.751