Early Diseases Diagnosis of Chropnic via Machine Learning Based Models in Big Data Health Records

Authors

  • Venkata Teja Nagumotu Sr Network Engineer, Techno-bytes Inc . Author
  • Harsha Vardhan Reddy Kavuluri Lead database administrator, Wissen infotech Inc. Author
  • Akhil Kumar Pathani Network Engineer, Ebay. Author
  • Ajay Dasari Senior Support Engineer, Microsoft. Author
  • Venkata Kishore Chilakapati Support Escalation Engineer, Microsoft. Author
  • Srikanth Reddy Keshireddy Senior Software Engineer, .Keen Info Tek Inc. Author

DOI:

https://doi.org/10.63282/3050-922X.IJERET-V5I2P114

Keywords:

Chronic Kidney Disease Detection, Early Detection, Deep Neural Network, Chronic Kidney Disease (CKD) Dataset, Predictive Analytics, Health Informatics

Abstract

Today's people suffer from a wide variety of diseases due to various influences and choices made at the community level. Thus, to prevent the occurrence of such illnesses, persistent identification and prediction are paramount. Manually determining the disorders is generally challenging for doctors to be accurate with the exact numbers. Using massive data extracted from EHRs, this research lays forth an effective machine learning (ML) approach for CKD early diagnosis. Data preparation steps (including outlier removal, missing value replacement and transforming categorical data) are done before using normalization and RFE to find the best features. ETC is used as the main classification model because it helps to improve prediction and reduces the chances of overfitting by splitting the data randomly. With an accuracy (ACC) of 99.5%, the model is very effective in diagnosing CKD. When evaluation measures include precision (PRE), recall (REC), F1-score (F1), and AUC-ROC, it shows that the approach performs well. They prove that using machine learning and big data together can enhance how early diagnosis and decisions are made in chronic disease cases.

References

[1] A. Balasubramanian, “Intelligent Health Monitoring: Leveraging Machine Learning and Wearables for Chronic Disease Management and Prevention,” Int. J. Innov. Res. Eng. Multidiscip. Phys. Sci., vol. 7, no. 6, pp. 1–13, 2019, doi: 10.5281/zenodo.14535443.

[2] L. Segall, I. Nistor, and A. Covic, “Heart failure in patients with chronic kidney disease: A systematic integrative review,” Biomed Res. Int., vol. 2014, 2014, doi: 10.1155/2014/937398.

[3] R. Reynolds et al., “A systematic review of chronic disease management interventions in primary care,” BMC Fam. Pract., vol. 19, no. 1, p. 11, Dec. 2018, doi: 10.1186/s12875-017-0692-3.

[4] W. Raghupathi and V. Raghupathi, “An Empirical Study of Chronic Diseases in the United States: A Visual Analytics Approach to Public Health,” Int. J. Environ. Res. Public Health, vol. 15, no. 3, p. 431, Mar. 2018, doi: 10.3390/ijerph15030431.

[5] N. Bhardwaj, B. Wodajo, A. Spano, S. Neal, and A. Coustasse, “The Impact of Big Data on Chronic Disease Management,” Health Care Manag. (Frederick)., vol. 37, no. 1, pp. 90–98, Jan. 2018, doi: 10.1097/HCM.0000000000000194.

[6] C. S. Kruse, R. Goswamy, Y. Raval, and S. Marawi, “Challenges and Opportunities of Big Data in Health Care: A Systematic Review,” JMIR Med. Informatics, vol. 4, no. 4, p. e38, Nov. 2016, doi: 10.2196/medinform.5359.

[7] S. Achouche, U. B. Yalamanchi, and N. Raveendran, “Method, apparatus, and computer-readable medium for performing a data exchange on a data exchange framework,” 2019

[8] R. Miotto, F. Wang, S. Wang, X. Jiang, and J. T. Dudley, “Deep learning for healthcare: review, opportunities and challenges,” Brief. Bioinform., vol. 19, no. 6, pp. 1236–1246, Nov. 2018, doi: 10.1093/bib/bbx044.

[9] M. Elhoseny, K. Shankar, and J. Uthayakumar, “Intelligent Diagnostic Prediction and Classification System for Chronic Kidney Disease,” Sci. Rep., vol. 9, no. 1, p. 9583, Jul. 2019, doi: 10.1038/s41598-019-46074-2.

[10] M. Almasoud and T. E, “Detection of Chronic Kidney Disease using Machine Learning Algorithms with Least Number of Predictors,” Int. J. Adv. Comput. Sci. Appl., vol. 10, no. 8, 2013, doi: 10.14569/IJACSA.2019.0100813.

[11] Y. Amirgaliyev, S. Shamiluulu, and A. Serek, “Analysis of Chronic Kidney Disease Dataset by Applying Machine Learning Methods,” in 2018 IEEE 12th International Conference on Application of Information and Communication Technologies (AICT), 2018, pp. 1–4. doi: 10.1109/ICAICT.2018.8747140.

[12] V. S, S. S, V. H, and S. S, “Disease Prediction Using Machine Learning Over Big Data,” Comput. Sci. Eng. An Int. J., vol. 8, no. 1, pp. 01–08, Feb. 2018, doi: 10.5121/cseij.2018.8101.

[13] S. B. Akben, “Early Stage Chronic Kidney Disease Diagnosis by Applying Data Mining Methods to Urinalysis, Blood Analysis and Disease History,” IRBM, vol. 39, no. 5, pp. 353–358, Nov. 2018, doi: 10.1016/j.irbm.2018.09.004.

[14] S. Tekale, P. Shingavi, and S. Wandhekar, “Prediction of Chronic Kidney Disease Using Machine Learning Algorithm,” IJARCCE, vol. 7, pp. 92–96, 2018, doi: 10.17148/IJARCCE.2018.71021.

[15] M. Chen, Y. Hao, K. Hwang, L. Wang, and L. Wang, “Disease Prediction by Machine Learning Over Big Data From Healthcare Communities,” IEEE Access, vol. 5, pp. 8869–8879, 2017, doi: 10.1109/ACCESS.2017.2694446.

[16] A. V. Hazarika and Anju, “Extreme Gradient Boosting using Squared Logistics Loss function,” Int. J. Sci. Dev. Res., vol. 2, no. 8, pp. 54–61, 2017.

[17] R. Tarafdar and Y. Han, “Finding Majority for Integer Elements,” J. Comput. Sci. Coll., vol. 33, no. 5, pp. 187–191, 2018.

[18] M. Almasoud and T. E. Ward, “Detection of Chronic Kidney Disease using Machine Learning Algorithms with Least Number of Predictors,” Int. J. Adv. Comput. Sci. Appl., vol. 10, no. 8, pp. 89–96, 2013, doi: 10.14569/IJACSA.2019.0100813.

[19] J. Xiao et al., “Comparison and development of machine learning tools in the prediction of chronic kidney disease progression,” J. Transl. Med., vol. 17, no. 1, pp. 1–13, 2019, doi: 10.1186/s12967-019-1860-0.

[20] Mamidala, J. V., Enokkaren, S. J., Attipalli, A., Bitkuri, V., Kendyala, R., & Kurma, J. (2023). Machine Learning Models Powered by Big Data for Health Insurance Expense Forecasting. International Research Journal of Economics and Management Studies IRJEMS, 2(1).

[21] Nadella, V. M. (2023). Zero Trust Architecture for Telecom Operations. International Journal of Emerging Research in Engineering and Technology, 4(3), 115-129.

[22] Bitkuri, V., Kendyala, R., Kurma, J., Enokkaren, S. J., & Mamidala, J. V. (2023). Forecasting Stock Price Movements With Deep Learning Models for time Series Data Analysis. Journal of Artificial Intelligence & Cloud Computing. SRC/JAICC-531. DOI: doi. org/10.47363/JAICC/2023 (2), 489, 2-9.

[23] Nadella, V. M. (2023). Anomaly Detection and Fault Prediction using ML in Telecom Operations. International Journal of Emerging Trends in Computer Science and Information Technology, 4(3), 134-143.

[24] Kosaraju, P., & Nadella, V. M. (2022). Security and Privacy in IoT Ecosystems. Universal Library of Engineering Technology, (Issue).

[25] Singh, A. A. S. S., Mania, V., Kothamaram, R. R., Rajendran, D., Namburi, V. D. N., & Tamilmani, V. (2023). Exploration of Java-Based Big Data Frameworks: Architecture, Challenges, and Opportunities. Journal of Artificial Intelligence & Cloud Computing, 2(4), 1-8.

[26] Routhu, K. K. (2023). AI-driven succession planning in Oracle HCM Cloud: Building resilient leadership pipelines through predictive analytics. International Journal of Science, Engineering and Technology, 11(5).

[27] Tamilmani, V., Namburi, V. D., Singh Singh, A. A., Maniar, V., Kothamaram, R. R., & Rajendran, D. (2023). Real-Time Identification of Phishing Websites Using Advanced Machine Learning Methods. Available at SSRN 5837142.

[28] Routhu, K. K. (2023). AI-driven succession planning in Oracle HCM Cloud: Building resilient leadership pipelines through predictive analytics. International Journal of Science, Engineering and Technology, 11(5). https://doi.org/10.5281/zenodo.17292018

[29] From Fragmentation to Focus: The Benefits of Centralizing Procurement. (2023). International Journal of Research and Applied Innovations, 6(6), 9820-9833. https://doi.org/10.15662/

[30] Routhu, K. K. (2023). Embedding fairness into the digital enterprise, data driven DEI strategies with Oracle HCM Analytics. International Journal of Scientific Research in Computer Science, Engineering and Information Technology, 9(8), 266-274.

[31] Routhu, K. K. (2023). AI-driven skills forecasting in Oracle HCM Cloud: From static competencies to predictive workforce design. International Journal of Science, Engineering and Technology, 11(1).

[32] Padur, S. K. R. (2023). AI-Augmented Enterprise ERP Modernization: Zero-Downtime Strategies for Oracle E-Business Suite R12. 2 and Beyond. Available at SSRN 5605510.

[33] Routhu, K. K. (2022). From Case Management to Conversational HR: Redefining Help Desks with Oracle’s AI and NLP Framework. International Journal of Science, Engineering and Technology, 10(6).

[34] Vattikonda, N., Gupta, A. K., Polu, A. R., Narra, B., Buddula, D. V. K. R., & Patchipulusu, H. H. S. (2022). Blockchain Technology in Supply Chain and Logistics: A Comprehensive Review of Applications, Challenges, and Innovations. International Journal of Emerging Trends in Computer Science and Information Technology, 3(3), 72-80.

[35] Attipalli, A., BITKURI, V., Mamidala, J. V., Kendyala, R., & KURMA, J. (2022). Empowering Cloud Security with Artificial Intelligence: Detecting Threats Using Advanced Machine learning Technologies. Available at SSRN 5741263.

[36] Padur, S. K. R. (2022). Intelligent resource management: AI methods for predictive workload forecasting in cloud data centers. J. Artif. Intell. Mach. Learn. & Data Sci, 1(1), 2936-2941.

[37] Nadella, V. M. (2022). Digital Twins for Predictive Network Management and System Simulation. International Journal of AI, BigData, Computational and Management Studies, 3(3), 100-111.

[38] Routhu, K. K. (2022). From RFID to Geofencing: IoT-Enabled Smart Time Tracking in Oracle HCM Cloud. International Journal of Science, Engineering and Technology, 10(4).

[39] Nadella, V. (2019). Extracting road traffic data through video analysis using automatic camera calibration and deep neural networks.

[40] Polam, R. M., Kamarthapu, B., Kakani, A. B., Nandiraju, S. K. K., Chundru, S. K., & Vangala, S. R. (2022). Data Security in Cloud Computing: Encryption, Zero Trust, and Homomorphic Encryption. International Journal of Emerging Trends in Computer Science and Information Technology, 3(4), 31-41.

[41] Padur, S. K. R. (2022). AI augmented platform engineering, transforming developer experience through intelligent automation and self optimizing internal platforms. International Journal of Science, Engineering and Technology, 10(5), 10-5281.

[42] Kosaraju, P. , & Nadella, V. M. (2021). Quality of Experience (QoE) and Network Performance Modelling for Multimedia Traffic. Journal of Artificial Intelligence and Big Data, 1(1), 1-13. https://doi.org/10.31586/jaibd.2021.1358.

Downloads

Published

2024-06-30

Issue

Section

Articles

How to Cite

1.
Nagumotu VT, Reddy Kavuluri HV, Pathani AK, Dasari A, Chilakapati VK, Keshireddy SR. Early Diseases Diagnosis of Chropnic via Machine Learning Based Models in Big Data Health Records. IJERET [Internet]. 2024 Jun. 30 [cited 2026 Apr. 15];5(2):136-4. Available from: https://ijeret.org/index.php/ijeret/article/view/529