Şirket İflaslarının Tahmin Edilmesinde Karar Ağacı Algoritmalarının Karşılaştırmalı Başarım Analizi

-- Bu çalışmada, önemli bir ekonomik problem olan, şirket iflaslarının tahmin edilmesi ele alınmıştır. Bunun için iki yüz kırk farklı şirkete ilişkin finansal özellikleri içeren bir veri seti kullanılmıştır. Ele alınan veri seti, sınıflandırma ve tahmin etmede kullanılan önemli yöntemlerden biri olan karar ağacı yöntemine ilişkin yedi farklı algoritma uygulanarak, doğru sınıflandırma yüzdesi, ortalama mutlak hata, ortalama karesel hatanın karekökü, kesinlik, geri çağırma, F-ölçütü gibi ölçütler bakımından değerlendirilmiştir. Deneysel sonuçlar incelendiğinde, karar ağacı algoritmalarının şirket iflaslarının tahmin edilmesi için uygun bir yöntem olduğu ve kısmen başarılı doğru sınıflandırma yüzdesi elde ettiği gözlemlenmiştir

Comparative Performance Analysis of Decision Tree Algorithms in the Corporate Bankruptcy Prediction

-- In this study, corporate bankruptcy prediction, a crucial economic problem is tackled. To do this, a data set of 240 distinct companies with financial features is used. This data set is applied to one of the most important classification and forecasting methods, i.e. decision tree method. Seven different decision tree algorithms are evaluated in terms of accuracy percentage, mean absolute error, root mean squared error, precision, recall, F-measure. According to experimental results, decision tree algorithms are appropriate methods for corporate bankruptcy prediction with relatively successful accuracy rates

___

  • [1] S. Sumathi, S.N. Sivanandam, Introduction to Data Mining and Its Applications, Springer-Verlag, Berlin, 2006.
  • [2] F. Gorunescu, Data Mining: Concepts, Models and Techniques, Springer-Verlag, Berlin, 2011.
  • [3] J. R. Quinlan, C4.5: Programs for Machine Learning, Morgan Kauffman, San Francisco, CA, USA, 1993.
  • [4] I. Wayne, P. Langley, "Induction of One-Level Decision Trees", Proceedings of the Ningth International Conference on Machine Learning, 233- 240, 1992.
  • [5] P. Domingos, P., G. Hulten, "Mining High-speed Data Streams", Proceedings of the sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 71-80, 2000.
  • [6] N. Landwehr, M. Hall, E. Frank, "Logistic Model Trees", Machine Learning, 59(1-2), 161-205, 2005.
  • [7] L. Breiman, "Random Forests", Machine Learning, 45(1), 5-32, 2001.
  • [8] W. Fan, H. Wang, P. S. Yu, S. Ma, "Is Random Model Better? On Its Accuracy and Efficiency", The Third IEEE International Conference on Data Mining, 51-58, 2003.
  • [9] Y. Zhao, Y. Zhang, "Comparison of Decision Tree Methods for Finding Active Objects", Advances in Space Research, 41(12), 1955-1959, 2008.
  • [10] D. L. Olson, D. Delen, Y. Meng, "Comparative Analysis of Data Mining Methods for Bankruptcy Prediction", Decision Support Systems, 52(2), 464-473, 2012.
  • [11] M. Lam, "Neural Network Techniques For Financial Performance Prediction: Integrating Fundamental And Technical Analysis", Decision Support Systems, 37(4), 567-581, 2004.
  • [12] S. Aoki, Y. Hosonuma, "Bankruptcy Prediction Using Decision Tree", Proceedings of the Second Nikkei Econophysics Symposium, 299-302, 2004.
  • [13] M. F. Santos, P. Cortez, J. Pereira, H. Quintela, "Corporate bankruptcy prediction using data mining techniques", WIT Transactions on Information and Communication Technologies, 37, 349-357, 2006.
  • [14] J. Neves, A. Vieria, "Improving Bankruptcy Prediction With Hidden Layer Learning: Vector Quantization", The European Accounting Review, 15(2), 253-271, 2006.
  • [15] E. Alfaro, N. Garcia, M. Gamez, D. Elizondo, "Bankruptcy Forecasting: An Empirical Comparision o
  • [16] A. Nachev, "Fuzzy ARTMAP Neural Network for Classifying the Financial Health of a Firm", Lecture Notes in Computer Science, 5027(2008), 82-91, 2008.
  • [17] S. Cho, H. Hong, B-C. Ha, "A Hybrid Approach Based On The Combination of Variable Selection Using Decision Trees And Case-Based Reasoning Using the Mahalanobis Distance: For Bankruptcy Prediction", Expert Systems with Applications, 37(4), 3482-3488, 2010.
  • [18] H. R. Doolatabadi, S. M. Hoseini, R. Tahmasebi, "Using Decision Tree Model and Logistic Regression to Predict Companies Financial Bankruptcy in Tehran Stock Exchanges", International Journal of Emerging Research in Management &Technology, 2(9), 7-16, 2013.
  • [19] E. Zibanezhad, B. Mobarake, D. Foroghi, "Applying decision tree to predict bankruptcy", IEEE International Conference on Computer Science and Automation Engineering (CSAE), 165-169, 2011.
  • [20] Internet: V. Mohan, Decision Trees: A comparison of various algorithms for building decision trees, http://cs.jhu.edu/~vmohan3/document/ai_dt.pdf, 08.12.2014.
  • [21] O., Maimon, L. Rokach, "Classification Trees". Data Mining and Knowledge Discovery Handbook, Editör: O., Maimon, L. Rokach, Springer, New York, A.B.D., 149-175, 2010.
  • [22] J. Gehrke, "Decision Trees", The Handbook of Data Mining, Editör: Nong Ye, Lawrence Erlbaum Associates Publishers, London, 149-175, 2003.
  • [23] P.J. García-Laencina, J.L. Sancho-Gómez, A.R. Figueiras-Vidal, "Pattern Classification with Missing Data: A Review", Neural Comput. Appl., 19(2), 263-282, 2010.
  • [24] D. Birant, "Comparison of Decision Tree Algorithms for Predicting Potential Air Pollutant Emissions with Data Mining Models", Journal of Environmental Informatics, 17(1), 46-53, 2011.
  • [25] R. Kothari, M. Dong, "Decision Trees for Classification: A Review and Some New Results", Pattern Recognition: From Classical to Modern Approaches, Editör: S. K. Pal, A. Pal, World Scientific, New Jersey, 2001.
  • [26] S. B. Kotsiantis, "Decision Trees: A Recent Overview", Artificial Intelligence Review, 39(4), 261-283, 2013.
  • [27] J. Quinlan, "Induction of Decision Trees", Machine Learning, 1(1), 81-106, 1986.
  • [28] X. Niuniu, L. Yuxun, "Review of Decision Trees", The Third IEEE International Conference on Computer Science and Information Technology, 105- 109, 2010.
  • [29] I. H. Witten, E. Frank, Data Mining: Practical Machine Learning Tools and Techniques, 2. Baskı, San Elsevier, Francisco, 2005.
  • [30] P. Doetsch, C. Buck, P. Golik, N. Hoppe, M. Kramp, J. Laudenberg, C. Oberdörfer, P. Steingrube, J. Forster, A. Mauser, "Logistic Model Trees with AUCsplit Criterion for the KDD Cup 2009 Small Challenge", JMLR: Workshop and Conference Proceedings, 77-88, 2009.
  • [31] W. Pietruszkiewicz, "Dynamical Systems and Nonlinear Kalman Filtering Applied in Classification", Proceedings of 2008 7th IEEE International Conference on Cybernetic Intelligent Systems, 263- 268, 2008.
  • [32] Internet: Mean Absolute Error, http://en.wikipedia.org/wiki/Mean_absolute_error, 08.12.2014.
  • [33] K. Essig, H. Ritter, O. Strogan, T. Schack, "Influence of Movement Expertise on Visual Perception of Objects, Events and Motor Action: A Modeling Approach", Developing and Applying Biologically-Inspired Vision Systems, Editor: Pomplun, M., Suzuki, J., IGI Global, Hershey, A.B.D., 1-30, 2013.
  • [34] Internet: Precision and Recall, http://en.wikipedia.org/wiki/Precision_and_recal, 08.12.2014.