Predicting Liver Disease Using Decision Tree Ensemble Methods

Damages that may occur in the liver, which has an important task for the human body, can cause fatal consequences. For this reason, early diagnosis of liver disease is important. In this study, liver disease was tried to be diagnosed by using Ensemble learning methods, depending on several clinical values obtained from liver patients and healthy blood donors. In this context, Random Forest (RF), J48, AdaBoost, Gradient Boosting Classifiers (GBC), and Light Gradient Boosting Machine (Light GBM) algorithms from bagging and boosting models were used. The most successful classification result was obtained with the Light GBM algorithm as 98.8%, 98.1%, 99.4%, and 0.98%, respectively, in terms of accuracy, precision, recall, and kappa statistics using 10-fold cross-validation.

___

  • [1] World Health Organization (WHO), 2020. Hepatitis C Key Facts. https://www.who.int/newsroom/fact-sheets/detail/hepatitis-c (Accessed: September. 10, 2021).
  • [2] Hauri, A. M., Armstrong, G. L., & Hutin, Y. J. 2004. The global burden of disease attributable to contaminated injections given in health care settings. International journal of STD & AIDS, 15(1),pp.7-16.
  • [3] Khatun, M., & Ray, R. B. (2019). Mechanisms underlying hepatitis C virus-associated hepatic fibrosis. Cells, 8(10), 1249.
  • [4] Suk, K. T., & Kim, D. J. 2015. Staging of liver fibrosis or cirrhosis: The role of hepatic venous pressure gradient measurement. World journal of hepatology, 7(3), 607.
  • [5] Akkaya, O., Kiyici, M., Yilmaz, Y., Ulukaya, E., & Yerci, O. 2007. Clinical significance of activity of ALT enzyme in patients with hepatitis C virus. World journal of gastroenterology: WJG, 13(41), 5481.
  • [6] Pradat, P., Alberti, A., Poynard, T., Esteban, J. I., Weiland, O. et al. 2002. Predictive value of ALT levels for histologic findings in chronic hepatitis C: a European collaborative study. Hepatology, 36(4), pp.973-977.
  • [7] Awan, S. E., Bennamoun, M., Sohel, F., Sanfilippo, F. M., & Dwivedi, G. 2019. Machine learning‐based prediction of heart failure readmission or death: implications of choosing the right model and the right metrics. ESC heart failure, 6(2), pp.428-435..
  • [8] Oladimeji, O. O., Oladimeji, A., Olayanju, O. 2021. Machine Learning Models for Diagnostic Classification of Hepatitis C Tests. Frontiers in Health Informatics, 10(1), 70.
  • [9] Orooji, A., Kermani, F. 2021. Machine learning based methods for handling imbalanced data in hepatitis diagnosis. Frontiers in Health Informatics, 10(1), 57.
  • [10] Mostafa, F. B., Hasan, E. 2021. Machine Learning Approaches for Binary Classification to Discover Liver Diseases using Clinical Data. medRxiv.
  • [11] Gupta, S., Gupta, M. K. 2021. A comprehensive data‐level investigation of cancer diagnosis on imbalanced data. Computational Intelligence.
  • [12] Hoffmann, G., Bietenbeck, A., Lichtinghagen, R., Klawonn, F. 2018. Using machine learning techniques to generate laboratory diagnostic pathways—a case study. J Lab Precis Med, 3, 58.
  • [13] Dua, D., Graff, C. 2019. UCI Machine Learning Repository Irvine, CA: University of California, School of Information and Computer Science. http://archive.ics.uci.edu/ml. (Accessed: April, 26, 2021).
  • [14] Chawla, N. V., Bowyer, K. W., Hall, L. O., Kegelmeyer, W. P. 2002. SMOTE: synthetic minority over-sampling technique. Journal of artificial intelligence research, 16, pp.321-357.
  • [15] Yıldırım, P. 2016. Pattern classification with imbalanced and multiclass data for the prediction of albendazole adverse event outcomes. Procedia Computer Science, 83, pp.1013-1018.
  • [16] Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., & Witten, I. H. 2009. The WEKA data mining software: an update. ACM SIGKDD explorations newsletter, 11(1), pp.10-18.
  • [17] Types of Ensemble methods in Machine learning. Accessed: September. 11, 2021. [Online]. Available: https://towardsdatascience.com/types-of-ensemble-methods-in-machine-learning-4ddaf73879db. (Accessed: September. 11, 2021).
  • [18] Pal, M. 2005. Random forest classifier for remote sensing classification. International journal of remote sensing, 26(1), pp.217-222.
  • [19] Breiman, L. 1996. Bagging predictors. Machine learning, 24(2), pp.123-140.
  • [20] Quinlan, J. R. 2014. C4. 5: programs for machine learning. Elsevier.
  • [21] Freund, Y., Schapire, R. E. 1996. Experiments with a new boosting algorithm. In icml Vol. 96, pp. 148-156.
  • [22] Skurichina, M., Duin, R. P. 2002. Bagging, boosting and the random subspace method for linear classifiers. Pattern Analysis & Applications, 5(2), pp.121-135.
  • [23] Ke, G., Meng, Q., Finley, T., Wang, T., Chen, W., et al. (2017). Lightgbm: A highly efficient gradient boosting decision tree. Advances in neural information processing systems, 30, pp.3146-3154.
  • [24] An, T. K., Kim, M. H. 2010. A new diverse AdaBoost classifier. In 2010 International conference on artificial intelligence and computational intelligence. IEEE, (Vol. 1, pp. 359-363).
  • [25] Friedman, J. H. 2001. Greedy function approximation: a gradient boosting machine. Annals of statistics, pp.1189-1232.
  • [26] Hossin, M., Sulaiman, M. N. 2015. A review on evaluation metrics for data classification evaluations. International journal of data mining & knowledge management process, 5(2), 1.
  • [27] Suwardika, G. 2017. Pengelompokan Dan Klasifikasi Pada Data Hepatitis Dengan Menggunakan Support Vector Machine (SVM), Classification And Regression Tree (Cart) Dan Regresi Logistik Biner. Journal of Education Research and Evaluation, 1(3), pp.183-191.
  • [28] Chicco, D., Jurman, G. 2021. An ensemble learning approach for enhanced classification of patients with hepatitis and cirrhosis. IEEE Access, 9, pp.24485-24498.
  • [29] Hashem, S., ElHefnawi, M., Habashy, S., El-Adawy, M., Esmat, G., et al. 2020. Machine Learning Prediction Models for Diagnosing Hepatocellular Carcinoma with HCV-related Chronic Liver Disease. Computer methods and programs in biomedicine, 196, 105551.
Erciyes Üniversitesi Fen Bilimleri Enstitüsü Fen Bilimleri Dergisi-Cover
  • ISSN: 1012-2354
  • Yayın Aralığı: Yılda 3 Sayı
  • Başlangıç: 1985
  • Yayıncı: Erciyes Üniversitesi
Sayıdaki Diğer Makaleler

Evolutionary Image Resizing based Accuracy Optimization for Aerial Triangulation

Hacı Mustafa PALANCIOGLU

Thermally Exfoliated Graphene Oxide (TEGO) Reinforced Fast-Cure Epoxy Resin: Cure Behavior and Flexural Properties

Sinem ELMAS, Hatice Sinem SAS

Bakır Stresine Maruz Kalan Aspir (Carthamus tinctorius L.) Çeşitlerinde Yağ Asitleri Desaturaz-2 Genlerinin İfade Düzeylerinin Belirlenmesi

Ekrem BÖLÜKBAŞI, Sumer ARAS

Predicting Liver Disease Using Decision Tree Ensemble Methods

Fırat ORHANBULUCU, İrem ACER, Fatma LATİFOĞLU, Semra İÇER

Geometrik kalkülüse göre bikompleks sayılar ve bazı eşitsizlikler

Nilay DEĞİRMEN, Birsen SAĞIR DUYAR

Türkiye’nin Farklı İklim Bölgelerinde Yer Alan Eğitim Yapısı Dersliklerinin TS EN 17037 "Binalarda Gün ışığı Kullanımı" Standartı Çerçevesinde Değerlendirilmesi

Elif ÖZEN, Özlem SÜMENGEN

W-Ir Alaşım Schottky Engel Diyotların Performansının Sonlu Elemanlar Metoduyla İncelenmesi

Osman KAHVECİ, Mehmet Fatih KAYA

Farklı kristal yönelimine sahip Cu nano tellerine uygulanan mekanik çevrimin moleküler dinamik benzetimi ile incelenmesi

Sefa KAZANÇ

Hadron Çarpıştırıcılarında İki-leptonik ve Üç-leptonik Kanallarda Ağır Majorana Nötrino Araştırmalarına Fenomenolojik Yaklaşım

Emrah TIRAŞ, Kamuran DİLSİZ, Ayşe BAT

Cilt Kanseri Görüntü Sınıflandırması için Görüntü Ön İşlemenin Evrişimsel Sinir Ağları Performansı Üzerindeki Etkileri

Beyhan ADANUR DEDETURK, Kasım TAŞDEMİR, Burcu BAKİR-GUNGOR