Fault Detection in Manufacturing Companies with Ensemble Machine Learning Method

Fault Detection in Manufacturing Companies with Ensemble Machine Learning Method

Purpose – Multivariate control charts cannot be indicative of which variable is the cause of the out-ofcontrol signal. To keep the process under control, the cause of the out-of-control signal must be determined correctly. The study, it is aimed to predict the variable that causes the out-of-control with the highest accuracy when there is 2 sigma and 3 sigma shift from the mean. Design/methodology/approach – The method used in the study is machine learning-based detection analysis. The data set was taken from a company that produces furniture connecting part. Sample values were collected from the enterprise. Then the under-control samples were detected from these. According to these samples' mean and standard deviation values, data was produced in such a way that 2 sigma and 3 sigma shifts occur from the mean for training the machine learning algorithms. To predict the out-of-control samples three individual machine learning algorithms and three ensemble methods (Bagging, Boosting and Stacking) were used. In addition, 3 stacking models were developed using combinations of the individual algorithms. Findings – When the results are examined, higher accuracy has been reached by using a model developed with the stacking method than individual algorithms. The highest accuracy rates have been achieved as 69.00% for 2 sigma and 85.75% for 3 sigma shift with the stacking 3 models developed based on the stacking method. Discussion – The issue of detecting out-of-control signals in the quality processes in manufacturing companies with the least error was examined and the results were discussed.

___

  • Alfaro, E., Alfaro, J. L., Gamez, M., & Garcia, N. (2009). A boosting approach for understanding out-of-control signals in multivariate control charts. International Journal of Production Research, 47(24), 6821-6834.
  • Ali, M., Eyduran, E., Tariq, M. M., Tirink, C., Abbas, F., Bajwa, M. A., & Jan, S. (2015). Comparison of artificial neural network and decision tree algorithms used for predicting live weight at post weaning period from some biometrical characteristics in Harnai sheep. Pakistan Journal of Zoology, 47(6).
  • Alpaydın, E. (2012), Yapay Oğrenme, Boğaziçi University, 3. Editon.
  • Anwar, H., Qamar, U., & Muzaffar Qureshi, A. W. (2014). Global optimization ensemble model for classification methods. The Scientific World Journal, 2014.
  • Aparisi, F., & Sanz, J. (2010). Interpreting the out-of-control signals of multivariate control charts employing neural networks. International Journal of Computer and Information Engineering, 4(1), 24-28.
  • Arıoglu Salmona, M. Ö., & Fırat, S. (2005). Bir Endüstri İşletmesinde Çok Değişkenli İstatistiksel Süreç Kontrolü. V. Ulusal Üretim Araştırmaları Sempozyumu, İstanbul Ticaret Üniversitesi.
  • Asadi, A., & Farjami, Y. (2019). Online Mean Shift Detection in Multivariate Quality Control Using Boosted Decision Tree learning. Journal of System Management, 5(2), 81-106.
  • Bakdi, A., Kouadri, A., & Bensmail, A. (2017). Fault detection and diagnosis in a cement rotary kiln using PCA with EWMA-based adaptive threshold monitoring scheme. Control Engineering Practice, 66, 64-75.
  • Bhatia, N. (2010). Survey of nearest neighbor techniques. arXiv preprint arXiv:1007.0085.
  • Breiman, L. (1996). Bagging predictors. Machine learning, 24(2), 123-140.
  • Chen, L. H., & Wang, T. Y. (2004). Artificial neural networks to classify mean shifts from multivariate χ2 chart signals. Computers & Industrial Engineering, 47(2-3), 195-205.
  • Cheng, C. S., & Cheng, H. P. (2008). Identifying the source of variance shifts in the multivariate process using neural networks and support vector machines. Expert Systems with Applications, 35(1-2), 198-206.
  • Cheng, C. S., & Lee, H. T. (2012). Identifying the out-of-control variables of multivariate control chart using ensemble SVM classifiers. Journal of the Chinese Institute of Industrial Engineers, 29(5), 314-323.
  • Demir, N., & Dalkiliç, G. (2018). Modi ed stacking ensemble approach to detect network intrusion. Turkish Journal of Electrical Engineering & Computer Sciences, 26(1), 418-433.
  • Dreiseitl, S., Ohno-Machado, L., Kittler, H., Vinterbo, S., Billhardt, H., & Binder, M. (2001). A comparison of machine learning methods for the diagnosis of pigmented skin lesions. Journal of biomedical informatics, 34(1), 28-36.
  • Du, S., Lv, J., & Xi, L. (2012). On-line classifying process mean shifts in multivariate control charts based on multiclass support vector machines. International Journal of Production Research, 50(22), 6288-6310.
  • Edward-Jackson, J. (1985). Multivariate quality control. Communications in Statistics-Theory and Methods, 14(11), 2657-2688.
  • Freund, Y., Schapire, R., & Abe, N. (1999). A short introduction to boosting. Journal-Japanese Society For Artificial Intelligence, 14(771-780), 1612.
  • Gonzalez-de la Parra, M., & Rodriguez-Loaiza, P. (2003). Application of the multivariate T2 control chart and the Mason–Tracy–Young decomposition procedure to the study of the consistency of impurity profiles of drug substances. Quality Engineering, 16(1), 127-142.
  • Gowda, S., Kumar, H., & Imran, M. (2018). Ensemble based learning with stacking, boosting and bagging for unimodal biometric identification system, 12th International Conference on Recent Trends in Engineering and Technology, 30-36.
  • Guh, R. S., & Shiue, Y. R. (2008). An effective application of decision tree learning for on-line detection of mean shifts in multivariate control charts. Computers & Industrial Engineering, 55(2), 475-493.
  • Han, J., Pei, J., & Kamber, M. (2011). Data mining: concepts and techniques. Elsevier.
  • He, S., Wang, G. A., Zhang, M., & Cook, D. F. (2013). Multivariate process monitoring and fault identification using multiple decision tree classifiers. International Journal of Production Research, 51(11), 3355-3371.
  • Hotelling, H. (1947). Multivariate quality control, illustrated by the air testing of sample bombsights. Techniques of statistical analysis, 111-184.
  • Hu, L. Y., Huang, M. W., Ke, S. W., & Tsai, C. F. (2016). The distance function effect on k-nearest neighbor classification for medical datasets. SpringerPlus, 5(1), 1-9.
  • Huda, S., Abdollahian, M., Mammadov, M., Yearwood, J., Ahmed, S., & Sultan, I. (2014). A hybrid wrapper– filter approach to detect the source (s) of out-of-control signals in multivariate manufacturing process. European Journal of Operational Research, 237(3), 857-870.
  • Jiang, J., & Song, H. M. (2017). Diagnosis of out-of-control signals in multivariate statistical process control based on bagging and decision tree. Asian Business Research, 2(2), 1.
  • Jonathan, O., Omoregbe, N., & Misra, S. (2019). Empirical Comparison of Cross-Validation and Test Data on Internet Traffic Classification Methods. In Journal of Physics: Conference Series, 1299(1), p. 012044. IOP Publishing.
  • Karimi, S., Yin, J., & Baum, J. (2015). Evaluation methods for statistically dependent text. Computational Linguistics, 41(3), 539-548.
  • Lantz, B. (2013). Machine learning with R : learn how to use R to apply powerful machine learning methods and gain an insight into real-world applications. Birmingham: Packt Publishing Ltd.,66-343.
  • Lauría, E. J., Presutti, E., Kapogiannis, M., & Kamath, A. (2018). Stacking Classifiers for Early Detection of Students at Risk. In CSEDU (1) (pp. 390-397).
  • Li, T. F., Hu, S., Wei, Z. Y., & Liao, Z. Q. (2013). A framework for diagnosing the out-of-control signals in multivariate process using optimized support vector machines. Mathematical Problems in Engineering, 2013.
  • Ma, Z., Wang, P., Gao, Z., Wang, R., & Khalighi, K. (2018). Ensemble of machine learning algorithms using the stacked generalization approach to estimate the warfarin dose. PloS one, 13(10), e0205872.
  • Mason, R. L., & Young, J. C. (2002). Multivariate statistical process control with industrial applications. Society for Industrial and Applied Mathematics.
  • Mason, R. L., Tracy, N. D., & Young, J. C. (1995). Decomposition of T2 for multivariate control chart interpretation. Journal of quality technology, 27(2), 99-108.
  • Mitchell, T. (1997). Machine learning. McGraw Hill.
  • Montgomery, D. C. (2009). Introduction to statistical quality control. John Wiley & Sons.
  • Murphy, B. J. (1987). Selecting out of control variables with the T2 multivariate quality control procedure. Journal of the Royal Statistical Society: Series D, The Statistician, 36(5), 571-581.
  • Niaki, S. T. A., & Abbasi, B. (2005). Fault diagnosis in multivariate control charts using artificial neural networks. Quality and reliability engineering international, 21(8), 825-840.
  • Onan, A. (2018). Particle Swarm Optimization Based Stacking Method with an Application to Text Classification. Academic Platform Journal of Engineering and Science, 6(2), 134-141.
  • Oza, N. C., & Tumer, K. (2008). Classifier ensembles: Select real-world applications. Information fusion, 9(1), 4- 20.
  • Ramezan C.A., Warner T.A., & Maxwell A.E. (2019). Evaluation of sampling and cross-validation tuning strategies for regional-scale machine learning classification. Remote Sensing, 11(2), 185.
  • Raschka, S. (2018). Model evaluation, model selection, and algorithm selection in machine learning. arXiv preprint arXiv:1811.12808.
  • Refaeilzadeh, P., Tang, L., & Liu, H. (2009), C Cross-Validation, Springer, Boston.
  • Rokach, L. (2010). Ensemble-based classifiers. Artificial intelligence review, 33(1), 1-39.
  • Salvador–Meneses, J., Ruiz–Chavez, Z., & Garcia–Rodriguez, J. (2019). Compressed kNN: K-nearest neighbors with data compression. Entropy, 21(3), 234.
  • Shao, Y. E., & Lin, S. C. (2019). Using a time delay neural network approach to diagnose the out-of-control signals for a multivariate normal process with variance shifts. Mathematics, 7(10), 959.
  • Song, H., Xu, Q., Yang, H., & Fang, J. (2017). Interpreting out-of-control signals using instance-based Bayesian classifier in multivariate statistical process control. Communications in Statistics-Simulation and Computation, 46(1), 53-77.
  • Wolpert, D. H. (1992). Stacked generalization. Neural networks, 5(2), 241-259.
  • Yang, W. A. (2015). Monitoring and diagnosing of mean shifts in multivariate manufacturing processes using two-level selective ensemble of learning vector quantization neural networks. Journal of Intelligent Manufacturing, 26(4), 769-783.
  • Yu, J., Xi, L., & Zhou, X. (2009). Identifying source (s) of out-of-control signals in multivariate manufacturing processes using selective neural network ensemble. Engineering Applications of Artificial Intelligence, 22(1), 141-152.
  • Yu, Y., & Feng, Y. (2014). Modified cross-validation for penalized high-dimensional linear regression models. Journal of Computational and Graphical Statistics, 23(4), 1009-1027.
  • Zhang, C., & Ma, Y. (2012). Ensemble machine learning: methods and applications. Springer Science & Business Media.
  • Zhang, M., & Cheng, W. (2015). Recognition of mixture control chart pattern using multiclass support vector machine and genetic algorithm based on statistical and shape features. Mathematical Problems in Engineering, 2015.
  • Zhang, Y., Li, M., Han, S., Ren, Q., & Shi, J. (2019). Intelligent identification for rock-mineral microscopic images using ensemble machine learning algorithms. Sensors. 19(18), 3914.
  • Zhao, C., Wang, C., Hua, L., Liu, X., Zhang, Y., & Hu, H. (2017). Recognition of control chart pattern using improved supervised locally linear embedding and support vector machine. Procedia engineering, 174, 281-288.
  • Zheng, A. (2015). Evaluating machine learning models: a beginner's guide to key concepts and pitfalls. O'Reilly Media.
  • Zhou, Z. H. (2012). Ensemble methods: foundations and algorithms. Chapman and Hall/CRC
İşletme Araştırmaları Dergisi-Cover
  • ISSN: 1309-0712
  • Yayın Aralığı: 4
  • Başlangıç: 2009
  • Yayıncı: Melih Topaloğlu
Sayıdaki Diğer Makaleler

Roles of the Jumps in Foreign Exchange Markets: A comparative Analysis of Non-Parametric Jump Tests

Ümit EROL, Serkan YEŞİLYURT

Covid-19 Pandemisinin BIST Sektör Endeksleri İle İlişkisi: Bayer – Hanck (2013) Eşbütünleşme Analizi

Ayşegül ERTUĞRUL AYRANCI, Gizem ARI

The Mediating Role of Green Practices in the Relationship between Effective Leadership Practices and Business Competitiveness

Ahmad Fathulla AHMAD, Georgiana KARADAS, Halgurd Fathulla AHMED

Corporate Social Responsibilities (CSR) and Empirical Insight into CSR Practises of Companies Listed on the BIST 100

Melih SÖNMEZ

Analysis of E-Complaint Management in Glamping Businesses within the Scope of a Case Study

F. Kübra AYLAN, Güney ÇETİN GÜRKAN

The Impact of Corporate Strategies on Financial Leverage: Evidence From TurkishListed Firms

Burcu DİNÇERGÖK, Ahmed Ibrahim Abdelhadi HOUR

Bitcoin İçin Volatilite Tahmini: Simetrik ve Asimetrik Garch Modelleri İçin Ampirik Bir Uygulama

Gülfen TUNA, Ahmet Bülent ATASOY

A Study About The Role of Cronyism Perceptions of Tourism Employees upon Their Career Plateu

Ebru ULUCAN, Nihan YAVUZ AKSAKAL

İşletmelerde COSO İç Kontrol Modeli ve Muhasebe Bilgi Sistemine Yönelik Uygulamaların Değerlendirilmesi Üzerine Bir Araştırma

Mehmet DEMİR, Hatice ÇİÇEKAY

Tüketicilerin Fiziksel Mağaza Alışverişlerinde Artırılmış Gerçeklik Uygulaması Olarak Akıllı Ayna Kullanmasının Davranışsal Niyete Etkisi

İbrahim AYDIN, Zübeyir ÇELİK