Breast Cancer Diagnosis with Machine Learning Techniques

Breast Cancer Diagnosis with Machine Learning Techniques

Cancer deaths are one of the highest rates of death. Although breast cancer is commonly associated with women, it is sometimes seen in men, and the mortality rate for men with breast cancer may be higher. The importance of early detection and treatment of breast cancer cannot be overstated. Cancer is diagnosed at an early stage thanks to expert systems, artificial intelligence, and machine learning approaches, and data analysis makes life easier for healthcare professionals. The nearest neighbor method, principal component analysis, neighborhood component method approaches were employed to detect breast cancer in this study."Breast Cancer Wisconsin Diagnostic" database was used to create and test the approach. According to the results obtained, the highest success rate with 99.42% was obtained by using neighborhood component analysis and nearest neighbor classification algorithm method.

___

  • [1] P. Gummadi, “Analysis of machine learning in education sector”, 1-5, 2020.
  • [2] A. Jagwani, “A review of machine learning in education”, Journal of Emerging Technologies and Innovative Research (JETIR), 6(5), 384-386, 2019.
  • [3] G. Buyrukoğlu, S. Buyrukoğlu, Z. Topalcengiz, “Comparing regression models with count data to artificial neural network and ensemble models for prediction of generic escherichia coli population in agricultural ponds based on weather station measurements”, Microbal Risk Analysis, 19, 100171, 2021.
  • [4] S. Buyrukoğlu, Y. Yılmaz, “An approach for airfare prices analysis with penalized regression methods”, Data Science, 4(2), 57-61, 2021.
  • [5] S. Buyrukoğlu, “Early detection of Alzheimer’s disease using data mining: comparison of ensemble feature selection approaches”, Konya Journal of Engineering Sciences, 9(1), 50-61, 2021.
  • [6] S. Buyrukoğlu, “Improvement of machine learning models’ performances based on ensemble learning for the detection of Alzheimer Disease”, 6th International Conference on Computer Science and Engineering, 102-106, 2021.
  • [7] World Health Organzation, “International agency for research on cancer-iarc”, 2020, https://gco.iarc.fr/today/home.
  • [8] L. Çelik, “Meme kanseri taramasinda yapay zeka”, 2020, https://www.drozdogan.com/turkiye-kanser- istatistikleri-2020/
  • [9] C. Eyupoglu, “Breast cancer classification using k-nearest neighbors’ algorithm”, The Online Journal of Science and Technology, 8(3), 29-34, 2018.
  • [10] Ł. Jeleń, A. Krzyżak, T. Fevens, M. Jeleń, “Influence of feature set reduction on breast cancer malignancy classification of fine needle aspiration biopsies”, Computers in Biology and Medicine, 79, 80- 91, 2016.
  • [11] S. Savaş, “Detecting the stages of Alzheimer’s disease with pre-trained deep learning architectures”, Arabian Journal for Science and Engineering, 47, 2201-2218, 2022.
  • [12] S. Savaş, N. Topaloğlu, Ö. Kazcı, P. Koşar, “Comparison of deep learning models in carotid artery intima-media thickness ultrasound images: CAIMTUSNet”, Journal of Information Technologies, 15(1), 1-12, 2022.
  • [13] P. Gupta, S. Garg, “Breast cancer prediction using varying parameters of machine learning models”, Procedia Computer Science, vol. 171, pp. 593–601, 2020.
  • [14] V. Chaurasia, S. Pal, B. Tiwari, “Prediction of benign and malignant breast cancer using data mining techniques”, Journal of Algorithms & Computational Technology, vol. 12, no. 2, pp. 119–126, 2018.
  • [15] M.H. Tafish, A.M. El-Halees, “Breast cancer severity degree predication using data mining techniques in the gaza strip”, in 2018 International conference on promising electronic technologies (ICPET), Deir El-Balah, pp. 124–128, 2018.
  • [16] V.N. Gopal, F.A. Turjman, L. Anand, M. Rajesh, “Feature selection and classification in breast cancer prediction using IoT and machine learning”, Measurement, 178, 109442, 2021.
  • [17] B. Sawssen, T. Okba, “A novel machine learning approach for breast cancer diagnosis”, Measurement, 187, 110233, 2022.
  • [18] F. Ismaili, L. Shabani, B. Raufi, J. Adjari, X. Zenuni, “Enhancing breast cancer detection using data mining classification techniques”, Press Academia Procedia, 2nd World Conference on Technology, Innovation and Enterpreunership, 2017, İstanbul, Turkey, 2017.
  • [19] İ. Ateş, T.T. Bilgin, “The investigation of the success of different machine learning methods in breast cancer diagnosis”, Konuralp Medical Journal, 13(2), 347-356, 2021.
  • [20] O. Sevli, “Performance Comparison of Different Machine Learning Techniques in Diagnosis of Breast Cancer”, European Journal of Science and Technology, 16, 176-185, 2019.
  • [21] A. Rahman, R.C. Muniyandi, D. Albashish, M. Rahman, O.L. Usman, ”Artificial neural network with taguchi method for robust classification model to improve classification accuracy of breast cancer”, PeerJ Computer Science, 2-27, 2020.
  • [22] M.F. Akay, “Support vector machines combined with feature selection for breast cancer diagnosis”, Expert Systems with Applications, 36, 3240-3247, 2009.
  • [23] S. Ibrahim, S. Nazir, S.A. Velastin, “Feature selection using correlation analysis and principal component analysis for accurate breast cancer diagnosis”, Journal of Imaging, 7(225), 3-16, 2021.
  • [24] M.H. Memon, J.P. Li, A.U. Haq, M.H. Memon, W. Zhou, “Breast cancer detection in the IOT health environment using modified recursive feature selection”, Hindawi Wireless Communication and Mobile Computing, 1-19, 2019.
  • [25] E.Y. Bayrak, P. Kırcı, T. Ensari, “Comparison of machine learning methods for breast cancer diagnosis”, 2019 Scientific Meeting on Electrical-Electronics & Biomedical Engineering and Computer Science (EBBT), 2019.
  • [26] P.P. Kumar, M.A.V. Bai, G.G. Nair, “An efficient classification framework for breast cancer using hyperparameter tuned random decision forest classifier and Bayesian optimization”, Biomedical Signal Processing and Control, 68, 102682, 2021.
  • [27] D. Dua, C. Graff, “UCI machine learning repository”, Irvine, CA: University of California, School of Information and Computer Science, 2019 http://archive.ics.uci.edu/ml
  • [28] A. Zang, A. Casari, “Feature engineering for machine learning: principles and techniques for data scientist”, O’Relly Media Publishing, Sebastopol, USA, 2018.
  • [29] F. Salo, A.B. Nassif, A. Essex,” Dimensionality reduction with IG-PCA and ensemble classifier for network intrusion detection”, Computer Networks, 148, 164-175, 2019.
  • [30] H.J. Chiu, T.H.S. Li, P.H. Kuo, “Breast cancer–detection system using PCA, multilayer perceptron, transfer learning, and support vector machine”, IEEE Access, 8, 204309-204324, 2020.
  • [31] S. Laghmati, B. Cherradi, A. Tmiri, O. Daanouni, S. Hamida, “Classification of patients with breast cancer using neighborhood component analysis and supervised machine learning techniques”, In 2020 3rd International conference on advanced communication technologies and networking (CommNet), IEEE pp. 1-6, 2020.
  • [32] S.F. Khorshid, A.M. Abdulazeez, “Breast cancer diagnosis based on k-nearest neighbors: a review”, PalArch's Journal of the archaeology of Egypt/Egyptology, 18(4), 1927-1951, 2021.
  • [33] R. Massafra, A. Latorre, A. Fanizzi, R. Bellotti, V. Didonna, V. Giotta, V. Lorusso, “A clinical decision support system for predicting invasive breast cancer recurrence: preliminary results”, Frontiers in Oncology, 11, 284, 2021.
  • [34] T. Savaş, S. Savaş, “Feature-based performance comparison of machine learning algorithms for phishing detection through uniform resource locator”, Journal of Polytechnic, early access, 2021.
  • [35] T.A. Assegie, “An optimized k-nearest neighbor-based breast cancer detection”, Journal of robotics and control (JRC), 2(3), 115-118, 2021.
  • [36] C.J. Willmott, K. Matsuura, “Advantages of the mean absolute error (MAE) over the root mean square error (RMSE)”, in assessing average model performance. Climate Research, 30, 79-82, 2005.
  • [37] G. Kou, Y. Peng, G. Wang, “Evaluation of clustering algorithms for financial risk analysis using MCDM methods”, Inf Sci (Ny), 275, 1–12, 2014
  • [38] H. Xiao, K. Yi, R. Peng, G. Kou, “Reliability of a Distributed Computing System with Performance Sharing”, IEEE Trans Reliab, 1–12, 2021.
  • [39] G. Kou, H. Xiao, M. Cao, L.H. Lee, “Optimal computing budget allocation for the vector evaluated genetic algorithm in multi-objective simulation optimization”, Automatica, 129, 109599, 2021.
  • [40] H. Xiao, Y. Yan, G. Kou, S. Wu, “Optimal Inspection Policy for a Single-Unit System Considering Two Failure Modes and Production Wait Time”, IEEE Trans Reliab, pp:1–13, 2021.