A COMPREHENSIVE ANALYSIS OF MACHINE LEARNING TECHNIQUES FOR INCESSANT PREDICTION OF DIABETES MELLITUS

Published 30 June 2020 •  vol 13  •  no 1  • 


Authors:

 

Shiva Shankar Reddy, Department of CSE, BPUT, Odisha, India
Nilambar Sethi, Department of CSE, GIET, Gunupur, Odisha, India
R. Rajender, Department of CSE, LENDI Institute of Engineering & Technology, Vizianagaram, A P, India

Abstract:

 

People in every country including Indians is severely affected by diabetes. Based on its importance diabetes researchers across the world are working towards thousands of different research goals. In this paper prediction problem of the diabetes patients’ early readmission is considered. The main aim of the work is to help the doctors and patients to predict the hospital readmission using a better performing machine learning algorithm. Given the diabetes dataset as input, different machine learning mechanisms are applied on Pima Indian Diabetes Dataset for comparison and obtaining the best performing algorithm. The Machine learning algorithms considered in this work are Logistic Regression (LR), Decision Tree (DT), Random Forest (RF), Adaptive Boosting (AB) and Gradient Boosting (GB). All these algorithms are applied on diabetes dataset from 130 hospitals from 1999-2008 provided by UCI machine learning repository. Fourty six attributes of this diabetes dataset are taken as input variables and hospital readmitted attribute is taken as output variable. All these algorithms are comprehensively analyzed based on the precision, recall and f1-score. This analysis could be used to select the suitable prediction algorithm. Then the task is to predict whether a diabetes patient is early readmitted with diabetes risks or not using the optimal algorithm. Finally the prediction could be used by doctors, patients and hospital authorities to monitor the diabetic patients.

Keywords:

 

Diabetes; Machine Learning; Logistic Regression (LR); Decision Tree (DT); Random Forest (RF); Adaptive Boosting (AB); Gradient Boosting (GB)

References:

 

[1] Axon, R Neal and Mark V Williams, “Hospital Readmission as an Accountability Measure”, JAMA, vol. 305, no. 5, (2011), pp. 504-05.
[2] Stone, Julie and Geoffrey J. Hoffman, “Medicare Hospital Readmissions: Issues, Policy Options and PPACA”, Congressional research service, (2010), pp. 1-37.
[3] Epstein, Arnold M., “Revisiting Readmissions -Changing the Incentives for Shared Accountability”, vol. 360, no. 14, (2009), pp. 1457-59, https://doi.org/10.1056/NEJMe0901006.
[4] Cook, Curtiss, Karen Seifert, Bryan Hull, Michael Hovan, Joseph Charles, Victoria Miller-Cage and Mary Boyle, “Inpatient to Outpatient Transfer of Diabetes Care: Planing for an Effective Hospital Discharge”, vol. 15, no. 3, (2009), pp. 263-69, https://doi.org/10.4158/ep.15.3.263.
[5] Draznin, Boris, Janice Gilden, Sherita H Golden and Silvio E Inzucchi, “Pathways to Quality Inpatient Management of Hyperglycemia and Diabetes: A Call to Action”, Diabetes Care, vol. 36, no. 7, (2013), pp. 1807-14.
[6] Dungan, Kathleen M., “The Effect of Diabetes on Hospital Readmissions”, SAGE Publications, (2012).
[7] Sisodia, Deepti and Dilip Singh Sisodia, “Prediction of Diabetes Using Classification Algorithms”, Procedia computer science, vol. 132, (2018), pp. 1578-85.
[8] Zou, Quan, Kaiyang Qu, Yamei Luo, Dehui Yin, Ying Ju, and Hua Tang, “Predicting Diabetes Mellitus with Machine Learning Techniques”, Frontiers in genetics, vol. 9, (2018).
[9] Kavakiotis, Ioannis, Olga Tsave, Athanasios Salifoglou, Nicos Maglaveras, Ioannis Vlahavas and Ioanna Chouvarda, “Machine Learning and Data Mining Methods in Diabetes Research”, Computational and structural biotechnology journal, vol. 15, (2017), pp. 104-16.
[10] Dutta, Suvajit, Bonthala CS Manideep, Syed Muzamil Basha, Ronnie D. Caytiles and N. Ch. S. N. Iyengar, “Classification of Diabetic Retinopathy Images by Using Deep Learning Models”, International Journal of Grid and Distributed Computing, vol. 11, no. 1, (2018), pp. 99-106.
[11] Yun, Jong-Seob and Jin Heon Kim, “A Study on Training Data Selection Method for EEG Emotion Analysis using Machine Learning Algorithm”, Environment, vol. 119, (2018).
[12] Kumar Dewangan, Amit and Pragati Agrawal, “Classification of Diabetes Mellitus Using Machine Learning Techniques”, International Journal of Engineering and Applied Sciences, vol. 2, no. 5, (2015).
[13] El Jerjawi, Nesreen Samer and Abu-Naser, Samy S., “Diabetes Prediction Using Artificial Neural Network”, International Journal of Advanced Science and Technology, vol. 121, (2019), pp. 54-64.
[14] Jayashree, J, J. Vijayashree and N. Ch. S. N. Iyengar, “Risk Level Prediction of Diabetic Retinopathy Using Adaptive Neural Fuzzy Inference System”, International Journal of Advanced Science and Technology, vol. 120, (2018), pp. 59-72.
[15] Mercaldo, Francesco, Vittoria Nardone and Antonella Santone, “Diabetes Mellitus Affected Patients Classification and Diagnosis through Machine Learning Techniques”, Procedia computer science, vol. 112, (2017), pp. 2519-28.
[16] Alnemer, Loai M., Lama Rajab and Ibrahim Aljarah, “Conformal Prediction Technique to Predict Breast Cancer Survivability”, International Journal of Advanced Science and Technology, vol. 96, (2016), pp. 1-10.
[17] Gaonkar, Abhijitsingh Putu, Ritwik Kulkarni, Ronnie D. Caytiles and N. Iyengar, “Classification of Lower Back Pain Disorder Using Multiple Machine Learning Techniques and Identifying Degree of Importance of Each Parameter”, International Journal of Advanced Science and Technology, vol. 105, (2017), pp. 11-24.
[18] Sharmila, K. and S Manickam, “Diagnosing Diabetic Dataset Using Hadoop and K-Means Clustering Techniques”, Indian Journal of Science and Technology, vol. 9, no. 40, (2016), pp. 1-5.
[19] Kumari, V Anuja and R Chitra, “Applications Classification of Diabetes Disease Using Support Vector Machine”, International Journal of Engineering Research, vol. 3, no. 2, (2013), pp. 1797-801.
[20] Bano, Saima and Khan, Naeem, “A Survey of Data Clustering Methods”, International Journal of Advanced Science and Technology, vol. 113, (2018), pp. 133-142.
[21] Kaur, Gaganjot and Amit Chhabra, “Improved J48 Classification Algorithm for the Prediction of Diabetes”, International Journal of Computer Applications, vol. 98, no. 22, (2014).
[22] Shankaracharya, Devang Odedra, Subir Samanta and Ambarish S Vidyarthi, “Computational Intelligence-Based Diagnosis Tool for the Detection of Prediabetes and Type 2 Diabetes in India”, The review of diabetic studies, vol. 9, no. 1, (2012), pp. 55. [23] Jordan, Michael I. and Robert A Jacobs, “Hierarchical Mixtures of Experts and the Em Algorithm”, Neural computation, vol. 6, no. 2, (1994), pp. 181-214.
[24] Anitha, Raju, S. Jyothi, Venkata Naresh Mandhala, Debnath Bhattacharyya and Tai-hoon Kim, “Deep Learning Image Processing Technique for Early Detection of Alzheimer’s Disease”, International Journal of Advanced Science and Technology, vol. 106, no. 1, (2017), pp. 85-104.
[25] Shin, MoonSun, HaYoung Lee, JinSoo Kim and JungHwan Kim, “Design of Intelligent Healthcare Support System Applying Machine Learning”, International Journal of Advanced Science and Technology, vol. 111, (2018), pp. 73-84.
[26] Mani, Subramani, Yukun Chen, Tom Elasy, Warren Clayton and Joshua Denny, “Type 2 Diabetes Risk Forecasting from Emr Data Using Machine Learning”, Paper presented at the AMIA annual symposium proceedings, (2012).
[27] Wang, DongYue and Whangbo, TaegKeun, “Automatic Diagnostic System for Parkinson’s Disease Based on Deep Learning using Midbrain Magnetic Resonance Images”, International Journal of Advanced Science and Technology, vol. 124, (2019), pp. 1-20.
[28] Barakat, Nahla, Andrew P Bradley and Mohamed Nabil H Barakat, “Intelligible Support Vector Machines for Diagnosis of Diabetes Mellitus”, IEEE transactions on information technology in biomedicine, vol. 14, no. 4, (2010), pp. 1114-20.
[29] Syed Muzamil Basha, “A Soft Computing Approach to Provide Recommendation on PIMA Diabetes”, International Journal of Advanced Science and Technology, vol. 105, no. 1, (2017), pp. 19-32.
[30] Patil, Ratna and Sharavari Tamane, “A Comparative Analysis on the Evaluation of Classification Algorithms in the Prediction of Diabetes”, International Journal of Electrical and Computer Engineering, vol. 8, no. 5, (2018), pp. 3966.
[31] Cox, David R, “The Regression Analysis of Binary Sequences”, Journal of the Royal Statistical Society: Series B, vol. 20, no. 2, (1958), pp. 215-32.
[32] Quinlan, J. Ross, “Induction of Decision Trees”, Machine learning, vol. 1, no. 1, (1986), pp. 81-106.
[33] Breiman, Leo, “Random Forests”, Machine learning, vol. 45, no. 1, (2001), pp. 5-32.
[34] Nai-arun, Nongyao and Rungruttikarn Moungmai, “Comparison of Classifiers for the Risk of Diabetes Prediction”, Procedia Computer Science, vol. 69, (2015), pp. 132-42.
[35] Schapire, Robert E, “The Strength of Weak Learnability”, Machine learning, vol. 5, no. 2, (1990), pp. 197-227.
[36] Kumar Das, Sujit, Arnab Kumar Mishra and Pinki Roy, “Automatic Diabetes Prediction Using Tree Based Ensemble Learners”, International Journal of Computational Intelligence & IoT, vol. 2, no. 2 (2019).

Citations:

 

APA:
Reddy, S. S., Sethi, N., & Rajender, R. (2020). A Comprehensive Analysis of Machine Learning Techniques for Incessant Prediction of Diabetes Mellitus. International Journal of Grid and Distributed Computing (IJGDC), ISSN: 2005-4262 (Print); 2207-6379 (Online), NADIA, 13(1), 1-22. doi: 10.33832/ijgdc.2020.13.1.01.

MLA:
Reddy, Shiva Shankar, et al. “A Comprehensive Analysis of Machine Learning Techniques for Incessant Prediction of Diabetes Mellitus.” International Journal of Grid and Distributed Computing (IJGDC), ISSN: 2005-4262 (Print); 2207-6379 (Online), NADIA, vol. 13, no. 1, 2020, pp. 1-22. IJGDC, http://article.nadiapub.com/IJGDC/vol13_no1/1.html.

IEEE:
[1] S. S. Reddy, N. Sethi, and R. Rajender, " A Comprehensive Analysis of Machine Learning Techniques for Incessant Prediction of Diabetes Mellitus." International Journal of Grid and Distributed Computing (IJGDC), ISSN: 2005-4262 (Print); 2207-6379 (Online), NADIA, vol. 13, no. 1, pp. 1-22, June 2020.