Diabetes Risk Prediction using Logistic Regression Algorithm
DOI:
https://doi.org/10.55123/jomlai.v1i2.598Keywords:
Diabetes, Logistic Regression, Recall, Confusion MatrixAbstract
Many factors affect people suffering from diabetes, some of which are high blood pressure, excess sugar levels, weight, genetic history of diabetes, age, number of pregnancies, skin fold thickness, and the amount of insulin levels in the body. Logistic regression is a statistical tool that can be used in classification modeling about the presence or absence of diabetes. The aim of this study is to predict diagnostically whether a patient has diabetes or not. The results obtained are relatively low predictions because the ranges of values of several factors that cause it are very far apart so normalization is carried out so that the ranges of values are close together. The result is that diabetes risk prediction using a logistic regression algorithm with normalization resulted in a recall of 55% while without normalization it was 43%. Thus, normalization can improve the performance of diabetes risk prediction using a logistic regression algorithm. This model is expected to be a reference for the treatment of diabetics for doctors in hospitals and in the community to find out how to maintain a lifestyle and how to avoid diabetes in terms of the variables that affect the occurrence of the disease.
References
Y. Safitri and I. K. A. Nurhayati, “Pengaruh Pemberian Sari Pati Bengkuang (Pachyrhizus Erosus) terhadap Kadar Glukosa Darah pada Penderita Diabetes Mellitus Tipe II Usia 40-50 Tahun di Kelurahan Bangkinang Wilayah Kerja Puskesmas Bangkinang Kota Tahun 2018,” J. Ners, vol. 3, no. 1, pp. 69–81, 2019.
F. Fatmawati, “Perbandingan Algoritma Klasifikasi Data Mining Model C4. 5 dan Naive Bayes untuk Prediksi Penyakit Diabetes,” Techno Nusa Mandiri J. Comput. Inf. Technol., vol. 13, no. 1, pp. 50–59, 2016.
U. I. Lestari, A. Y. Nadhiroh, and C. Novia, “Penerapan Metode K-Nearest Neighbor untuk Sistem Pendukung Keputusan Identifikasi Penyakit Diabetes Melitus,” JATISI (Jurnal Tek. Inform. dan Sist. Informasi), vol. 8, no. 4, pp. 2071–2082, 2021.
F. Nasution, A. Andilala, and A. A. Siregar, “Faktor Risiko Kejadian Diabetes Mellitus,” J. Ilmu Kesehat., vol. 9, no. 2, pp. 94–102, 2021.
R. R. Santoso, “Implementasi Metode Machine Learning menggunakan Algoritma Evolving Artificial Neural Network pada Kasus Prediksi Diagnosis Diabetes.” Universitas Pendidikan Indonesia, 2020.
A. Roihan, P. A. Sunarya, and A. S. Rafika, “Pemanfaatan Machine Learning dalam Berbagai Bidang,” IJCIT (Indonesian J. Comput. Inf. Technol., vol. 5, no. 1, pp. 75–82, 2020.
F. K. Lembang, “Analisis Faktor Resiko Penyebab Diabetes Mellitus di Kota Ambon menggunakan Model Regresi Logistik,” Stat. J. Theor. Stat. Its Appl., vol. 15, no. 2, pp. 65–71, 2015.
M. A. Suhendra, D. Ispriyanti, and S. Sudarno, “Ketepatan Klasifikasi Pemberian Kartu Keluarga Sejahtera di Kota Semarang menggunakan Metode Regresi Logistik Biner dan Metode Chaid,” J. Gaussian, vol. 9, no. 1, pp. 64–74, 2020.
M. I. Gunawan, D. Sugiarto, and I. Mardianto, “Peningkatan Kinerja Akurasi Prediksi Penyakit Diabetes Mellitus Menggunakan Metode Grid Seacrh pada Algoritma Logistic Regression,” JEPIN (Jurnal Edukasi dan Penelit. Inform., vol. 6, no. 3, pp. 280–284, 2020.
M. Marna, M. Saftari, P. Jana, and M. Maxrizal, “Analisis Regresi Logistik Biner untuk memprediksi Faktor Internal dan Eksternal terhadap Indeks Prestasi,” Delta J. Ilm. Pendidik. Mat., vol. 9, no. 1, pp. 47–56, 2021.
Uci Machine Learning, “Pima Indians Diabetes Database,” kaggle, 2016. [Online], Tersedia: https://www.kaggle.com/datasets/uciml/pima-indians-diabetes-database [Diakses: 22 April 2022].
N. A. Kurniawati and S. P. Rahayu, “Analisis Kadar CO, Titania, dan Suhu Terhadap Kelembaban Udara Menggunakan Preprocessing Data, Distribusi Normal Multivariat, Uji Bartlett, dan T2 Hotelling”.
Gde Agung Brahmana Suryanegara, Adiwijaya, and Mahendra Dwifebri Purbolaksono, “Peningkatan Hasil Klasifikasi pada Algoritma Random Forest untuk Deteksi Pasien Penderita Diabetes menggunakan Metode Normalisasi,” J. RESTI (Rekayasa Sist. dan Teknol. Informasi), vol. 5, no. 1, pp. 114–122, 2021, doi: 10.29207/resti.v5i1.2880.
scikit learn, “sklearn.linear_model.LogisticRegression — scikit-learn 1.1.1 documentation,” [Online], Tersedia: scikit learn.https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html#sklearn.linear_model.LogisticRegressi [Diakses: 08 Juni 2022].
K. S. Nugroho, “Confusion Matrix untuk Evaluasi Model pada Supervised Learning,” Confusion Matrix untuk Eval. Model pada Supervised Learn., 2019.
S. Gargate, “Evaluating your classification model,” 13 Desember 2019. [Online]. Tersedia: https://medium.com/swlh/evaluating-your-classification-model-cb49338abb96 [Diakses: 12 Mei 2022].
R. Arthana, “Mengenal Accuracy, Precision, Recall dan Specificity serta yang diprioritaskan dalam Machine Learning,” 05 April 2019, [Online]. Tersedia: https://rey1024.medium.com/mengenal-accuracy-precission-recall-dan-specificity-serta-yang-diprioritaskan-b79ff4d77de8 [Diakses: 12 Mei 2022].
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2022 Qatrunnada Refa Cahyani, Mochammad Januar Finandi, Jathu Rianti, Devi Lestari Arianti, Arya Dwi Pratama Putra

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Copyright (c) 2022 The authors. Published by Yayasan Literasi Indonesia
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License
The author(s) whose article is published in the JOMLAI journal attain the copyright for their article and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. By submitting the manuscript to JOMLAI, the author(s) agree with this policy. No special document approval is required.
The author(s) guarantee that:their article is original, written by the mentioned author(s),
- has never been published before,
- does not contain statements that violate the law, and
- does not violate the rights of others, is subject to copyright held exclusively by the author(s), and is free from the rights of third parties, and that the necessary written permission to quote from other sources has been obtained by the author(s).
The author(s) retain all rights to the published work, such as (but not limited to) the following rights:
- Copyright and other proprietary rights related to the article, such as patents,
- The right to use the substance of the article in its own future works, including lectures and books,
- The right to reproduce the article for its own purposes,
- The right to archive all versions of the article in any repository, and
- The right to enter into separate additional contractual arrangements for the non-exclusive distribution of published versions of the article (for example, posting them to institutional repositories or publishing them in a book), acknowledging its initial publication in this journal (JOMLAI: Journal of Machine Learning and Artificial Intelligence).
Suppose the article was prepared jointly by more than one author. Each author submitting the manuscript warrants that all co-authors have given their permission to agree to copyright and license notices (agreements) on their behalf and notify co-authors of the terms of this policy. JOMLAI will not be held responsible for anything that may arise because of the writer's internal dispute. JOMLAI will only communicate with correspondence authors.
Authors should also understand that their articles (and any additional files, including data sets, and analysis/computation data) will become publicly available once published. The license of published articles (and additional data) will be governed by a Creative Commons Attribution-ShareAlike 4.0 International License. JOMLAI allows users to copy, distribute, display and perform work under license. Users need to attribute the author(s) and JOMLAI to distribute works in journals and other publication media. Unless otherwise stated, the author(s) is a public entity as soon as the article is published



















