Comparison of M, MM and LTS estimators in linear regression in the presence of outlier

Comparison of M, MM and LTS estimators in linear regression in the presence of outlier

In this study, it was aimed to evaluate the performance of different estimators that will be used in regression analysis, which is one of the multivariate statistical methods in the presence of outliers in the data set. Sixth month live weight was estimated with various body measurements for Saanen kids taken from a private farm. In the data set, the use and performance of robust estimators were evaluated because the least squares method did not provide reliable results in the case of outliers. M (for Huber and Tukey bisquare) estimator, MM estimator and LTS estimator were used as robust used in the presence of outliers. MSE, RMSE, rRMSE, MAPE, MAD, R2 , R2 adj and AIC were used as model comparison criteria in the study. As a result of the study, in the case of outlier in the data set, Huber type M estimator can be recommended.

___

  • 1. Eyduran E, Zaborski D, Waheed A, Celik S, Karadas K et al. Comparison of the predictive capabilities of several data mining algorithms and multiple linear regression in the prediction of body weight by means of body measurements in the indigenous Beetal goat of Pakistan. Pakistan Journal of Zoology 2017; 49 (1): 257-265.
  • 2. Birteeb PT, Ozoje MO. Prediction of live body weight from linear body measurement of West African long-legged and West African dwarf sheep in Northern Ghana. Online Journal of Animal and Feed Research 2012; 2: 425-434.
  • 3. Eyduran E, Akin M, Eyduran SP. Application of Multivariate Adaptive Regression Splines through R Software. Ankara, Turkey: Nobel Academic Publishing; 2019.
  • 4. Yilmaz F, Bayyurt L, Abaci SH, Tahtali Y. Comparison of Least Squares and Some Bias Estimators in Multicollinearity. Turkish Journal of Agriculture - Food Science and Technology 2020; 8 (3): 793-799.
  • 5. Aytekin I, Eyduran E, Karadas K, Akşahan R, Keskin I. Prediction of fattening final live weight from some body measurements and fattening period in young bulls of crossbred and exotic breeds using MARS data mining algorithm. Pakistan Journal of Zoology 2018; 50 (1): 189-195.
  • 6. Yakubu A. Fixing collinearity instability ın the estimation of body weight from morpho-biometrical traits of West African dwarf goats. Trakia Journal of Sciences 2009; 7 (2): 61-66.
  • 7. Tadesse A, Gebremariam T, Gangwar SK. Application of linear body measurements for predicting body weight of Abergelle goat breed in Tigray Region, Northern Ethiopia. Global Joural of Bio-Science & Biotechnology 2012; 1 (2): 314-319.
  • 8. Dakhlan A, Saputra A, Hamdani MDI, Sulastri S. Regression models and correlation analysis for predicting body weight of female ettawa grade goat using its body measurements. Advances in Animal and Veterinary Sciences 2020; 8 (11): 1142-1146.
  • 9. Gore DLM, Muasya TK. Okeno TO, Mburu JN. Comparative reproductive performance of Saanen and Toggenburg bucks raised under tropical environment. Tropical Animal Health and Production 2020; 52: 2653-2658.
  • 10. Moaeen-ud-Din M, Ahmad N, Iqbal A, Abdullah M. Evaluation of different formulas for weight estimation in Beetal, Teddi and crossbred (Beetal x Teddi) goats. Journal of Animal and Plant Sciences 2006; 16 (3-4): 70-74.
  • 11. Parés PM, Mwaanga ES, Caballero M, Sabate J, Valenzuela S. Live weight estimation of Gwembe goat (Capra hircus) from measurement of thoracic girth. Journal of Veterinary Anatomy 2012; 5 (2): 9-14.
  • 12. Tyasi TL, Mathapo MC, Mokoena K, Maluleke D, Rashijane LT et al. Assessment of relationship between body weight and morphological traits of South African non-descript indigenous goats. Journal of Animal Health and Production 2020; 8 (1): 32-39.
  • 13. Eyduran E, Waheed A, Tariq MM, Iqbal F, Ahmad S. Prediction of live weight from morphological characteristics of commercial goat in Pakistan using factor and principal component scores in multiple linear regression. The Journal of Animal & Plant Sciences 2013; 23 (6): 1532-1540.
  • 14. Ari A, Onder H. Regression models used for different data structures. Anadolu Journal of Agricultural Sciences 2013, 28 (3): 168-174 (in Turkish with an abstract in English).
  • 15. Montgomery DC, Peck EA, Vining GG. Introduction to Linear Regression Analysis 3rd ed. New York, NY,USA: John Wiley & Sons, Inc.; 2001.
  • 16. Cankaya S, Kayaalp GT, Sangun L, Tahtali Y, Akar M. A comparative study of estimation methods for parameters in multiple linear regression model. Journal of Applied Animal Research 2006, 29 (1): 43-47.
  • 17. Gurunlu Alma O,Vupa O. The comparison of Least Squares and Least Median Squares Estimation methods which are used in linear regression analysis. Süleyman Demirel University Faculty of Arts and Science Journal of Science 2008, 3 (2): 219- 229 (in Turkish with an abstract in English).
  • 18. Maronna RA, Martin RD, Yohai VJ, Salibian-Barrera M. 2nd ed. Robust Statistics: Theory and Methods (with R). Oxford, UK: John Wiley & Sons; 2019.
  • 19. Filzmoser P. Kurnaz FS. A robust Liu regression estimator. Communications in Statistics - Simulation and Computation 2018; 47 (2): 432-443.
  • 20. Walfish S. A review of statistical outlier methods. Pharmaceutical Technology 2006, 30 (11): 82-86.
  • 21. Abacı SH. A comparative study of some estimation methods in simple linear regression model for different sample sizes in presence of outliers. MSc, Ondokuz Mayis University, Samsun, Turkey, 2013.
  • 22. Agullo J, Croux C, Van Aelst S. The multivariate least trimmed squares estimator. Journal of Multivariate Analysis 2008, 99 (3): 311-318.
  • 23. Revelle W. psych: Procedures for Personality and Psychological Research, Northwestern University, Evanston, Illinois, USA, 2020.
  • 24. R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria, 2020.
  • 25. Maechler M, Rousseeuw P, Croux C, Todorov V, Ruckstuhl A et al. robustbase: Basic Robust Statistics R package version 0.93-6; 2020.
  • 26. Alfons A. robustHD: Robust Methods for High-Dimensional Data. R package version 0.6.1; 2019.
  • 27. Eyduran E. ehaGoF: Calculates Goodness of Fit Statistics. R package version 0.1.0; 2019.
  • 28. Mendes M, Akkartal E. Regression tree analysis for predicting slaughter weight in broilers. Italian Journal of Animal Science 2009, 8: 615-624.
  • 29. Gokce A. Biased regression analysis based on some robust estimators. MSc, Eskişehir Osmangazi University, Eskişehir, Turkey, 2019.
  • 30. Box GE. Non-normality and tests on variances. Biometrika 1953; 40 (3/4): 318-335.
  • 31. Alpar R. Uygulamalı çok değişkenli istatistiksel yöntemler. 4. Baskı. Ankara, Turkey: Detay Yayıncılık; 2013.
  • 32. Cankaya S, Eker S, Abaci SH. Comparison of Least Squares, Ridge Regression and Principal Component approaches in the presence of multicollinearity in regression analysis. Turkish Journal of Agriculture-Food Science and Technology 2019; 7 (8): 1166-1172.
  • 33. Huber PJ. Robust Regression: Asymptotics, Conjectures, and Monte Carlo. Annals of Statistics 1973; 1: 799-821.
  • 34. SAS Institute Inc. 2018. SAS/STAT® 15.1 User’s Guide. Chapter 4. Cary, NC: SAS Institute Inc.
  • 35. Huber PJ. Robust Statistics. New York , USA: John Wiley & Sons; 1981.
  • 36. Alamgir, Ali A., Khan SH, Khan DM, Khalil U. A New Efficient Redescending M- Estimator: Alamgir Redescending M-estimator. Research Journal of Recent Sciences 2013; 2 (8): 79-91.
  • 37. Yohai VJ. High breakdown-point and high efficiency robust estimates for regression. The Annals of Statistics 1987; 15: 642- 656.
  • 38. Susanti Y, Pratiwi H, Sulistijowati HS, Liana T. M estimation, S estimation, and MM estimation in robust regression. International Journal of Pure and Applied Mathematics 2014; 91 (3): 349-360.
  • 39. Arslan O, Billor N. Robust Liu estimator for regression based on an M-estimator. Journal of Applied Statistics 2000; 27 (1): 39-47.
  • 40. Rousseeuw PJ. Least median of squares regression. Journal of American Statistical Association 1984; 79 (388): 871-880.
  • 41. Rousseeuw PJ, Leroy AM. Robust Regression and Outlier Detection. New York, NY, USA: John Wiley & Sons; 1987.
  • 42. Kan B, Alpu O, Yazici B. Robust ridge and robust Liu estimator for regression based on the LTS estimator, Journal of Applied Statistics 2013; 40 (3): 644-655.
  • 43. Tatliyer A. The effects of raising type on performances of some data mining algorithms in lambs. Journal Of Agriculture and Nature 2020; 23 (3): 772-780.
  • 44. Cankaya S. A Comparative study of some estimation methods for parameters and effects of outliers in simple regression model for research on small ruminants. Tropical Animal Health and Production 2009; 41 (1): 35-41.
  • 45. Coskuntuncel O. Eğitimle İlgili Sapan Değer İçeren Veri Kümelerinde En Küçük Kareler ve Robust M Tahmin Edicilerin Karşılaştırılması. Mersin University Journal of the Faculty of Education 2009; 5 (2): 251-262 (in Turkish with an abstract in English).
  • 46. Almetwally EM, Almongy, HM. Comparison between M-estimation, S-estimation, and MM estimation methods of robust estimation with application and simulation. International Journal of Mathematical Archive 2018; 9 (11): 55-63.
Turkish Journal of Veterinary and Animal Sciences-Cover
  • ISSN: 1300-0128
  • Yayın Aralığı: Yılda 6 Sayı
  • Yayıncı: TÜBİTAK
Sayıdaki Diğer Makaleler

Determination of the morphometric characteristics of donkey (Equus asinus) populations reared in Turkey

Fulya ÖZDİL, Selçuk KAPLAN, Serdar GENÇ, Hasan BULUT, Selen YATKIN, Sezen ARAT, Eser Kemal GÜRCAN, Mehmet İhsan SOYSAL, Emel ÖZKAN ÜNAL

The molecular prevalence of Borrelia burgdorferi, Babesia spp., and Anaplasma spp. in shelter dogs of the Thrace Region in Turkey

Nuri ALTUĞ, Fulya ALTINOK YİPEL, Dilek MUZ, Mustafa Necati MUZ

Comparison of M, MM and LTS estimators in linear regression in the presence of outlier

Cem TIRINK, Hasan ÖNDER

A biological perspective on interpreting interaction effect

Serdar GENÇ, Mehmet MENDEŞ

The effects of biological and health characteristics of dogs on intraindividual variability of blood parameters

Marko CINCOVIĆ, Sandra NIKOLIĆ, Branislava BELIĆ, Nikolina NOVAKOV, Nada PLAVŠA, Sara SAVIĆ

Do horses learn how to reach for feed depending on the time of day?

Agnieszka ZIEMIAŃSKA, Maciej TROJAN, Konrad BAZEWICZ, Justyna WOJTAŚ, Tomasz PRÓCHNIAK, Iwona ROZEMPOLSKA-RUCIŃSKA

Monensin induces apoptosis in the liver tissue and primary hepatocytes of chicks

Muhammet Yasin TEKELİ, Murat KANBUR

Forage turnip (Brassica rapa) harvested in different phases of vegetative stage and ensiled with the additives of molasses and barley and the effects of additives on silage quality, in vitro digestibility, and energy content

Serhat YILDIZ, Çağrı KALE, Suphi DENİZ, Fatma ÖZKAN

Investigating genetic diversity of Indonesian native cattle breeds using mitochondrial DNA 16S rRNA gene

Cece SUMANTRI, Jakaria JAKARIA, Restu MISRIANTI, Sony Hartono WIJAYA

The in vitro effect of hypochlorous acid-metal nanoparticles combination on Salmonella under different temperature conditions

Karlo MURATOĞLU, Serkan Kemal BÜYÜKÜNAL, Ahmet KOLUMAN