Detection of Vocal Cyst Problem by Using High Order Moments and Support Vector Machines

Detection of Vocal Cyst Problem by Using High Order Moments and Support Vector Machines

The voice disorders occurring due to the problems in the voice producing organs cause some changes in the intensity or tone of the voice. It is difficult to identify the diseased voice by the reason of its variable and different nature. One of the most popular voice disorder reasons is cyst which is located on the vocal cords. The aim of this study is to detect the vocal cyst problem by using acoustic voices data which were recorded from healthy people and patient with cyst diagnoses subjects by using high order statistics and support vector machines (SVMs) classifier. In this study, two experimental procedures were implemented for two different voice samples. In the first, /a/ vowel and the second, the Turkish word of “aydınlık” (mean in English “bright”) were investigated with skewness and kurtosis parameters which are third and fourth order cumulants (spectral moments), respectively. The obtained features values for healthy and cyst subjects were used as the SVMs’ inputs for classification. The experimental results show that the test accuracies of SVMs were found as 94.89% and 91.11% for /a/ vowel and “aydınlık” word, respectively. It is concluded from experimental studies that skewness provides more meaningful results than kurtosis in relation to distinguish into two voice groups as healthy and cyst. Additionally, it is assessed that the “aydınlık” is affective word for the pathological and normal acoustic voice discrimination as good as /a/vowel.

___

  • [1]. Lions Voice Clinic. University of Minnesota, Department of Otolaryngology, P.O. Box 487, 420 Delaware St., SE, Minneapolis, MN 55455, USA. URL:http://www.lionsvoiceclinic.umn.edu (Access time: October 2017)
  • [2].R.H.G. Martins, H.A. Amaral, E.L.M. Tavares, M.G. Martins, T.M. Gonçalves, N.H. Dias, “Voice disorders: Etilogy and diagnosis”, Journal of Voice, 30(6):761-769, (2016).
  • [3].M.A. Kılıç, “Ses problem olan hastanın objektif ve subjektif yöntemlerle değerlendirilmesi”, CurrPract ORL, 6(2):257-265, (2010).
  • [4].A. Lovato, W. De Colle, L. Giacomelli, A. Piacente, L. Righetto, G. Marioni, C. Filippis, “Multi-dimensional voice program (MDVP) vs Praat for assessing euphonic subjects: A preliminary study on the gender-discriminating power of acoustic analysis software”, Journal of Voice, 30(6):765e1- 765e5, (2016).
  • [5].M.O. Rosa, J.C. Pereira, M. Grellet, and A.C.P.L.F. Carvalho, “Signal processing and statistical procedures to identify laryngeal pathologies”, IEEE International Conference on Electronics, Circuits, and Systems, 1: 423- 426, (1999).
  • [6].M.K. Arjmandi, M. Pooyan, M. Mikaili, M. Vali, A. Moqarehzadeh, “Identification of voice disorders using long-time features and support vector machine with different feature reduction methods”, Journal of Voice, 25(6), e275-e289, (2011).
  • [7].S.W. Sonu, R.K. Sharma, “Disease detection using analysis of voice parameters”, International Journal of Computing Science and Communication Technologies, 4(2), (2012).
  • [8].M.A. Kılıç, “Ses bozuklukları: Yeni bir sınıflandırma sistemi”, Ç. Ü. Tıp Fakültesi Arşiv Kaynak Tarama Dergisi, 8(3):321-337, (1999).
  • [9].M. Gerçeker, İ. Yorulmaz, A. Ural, “Ses ve konuşma”, K.B.B. ve Baş Boyun Cerrahisi Dergisi, 8(1):71-78, (2000).
  • [10]. O. Türk, Ö. Şayli, A.S. Özsoy, L.M. Arslan, “Türkçe’de ünlülerin formant analizi”, Proceedings of the 18th National Conference in Turkish Linguistics, Ankara, Turkey, (2004).
  • [11]. X. Wang, J. Zhang, Y. Yan, “Discrimination between pathological and normal voices using GMM-SVM approach”, Journal of Voice, 25(1):38-43, (2011).
  • [12]. S. Hadjitodorov, P. Mitev, “A computer system for acoustic analysis of pathological voices and laryngeal diseases screening”, Med. Eng. Phys., 24:419–429, (2002).
  • [13]. J.D. Arias-Londono, J.I. Godino-Llorente, N. SaenzLechon, V. Osma-Ruiz, G. Castellanos-Dominguez, “An improved method for voice pathology detection by means of a HMM-based feature space transformation”, Pattern Recognition, 9(43):3100-3112, (2010).
  • [14]. I.A. Mahmoud, S.A. Hanaa, “Wavelet-based Mel frequency cepstral coefficient for speaker identification using hidden Markov models”, Journal of Telecommunications, 1 (2):16-21, (2010).
  • [15]. M.K. Arjmandi, M. Pooyan, “An optimum algorithm in pathological voice quality assessment using waveletpacket-based features, linear discriminant analysis and support vector machine”, Biomedical Signal Processing and Control, 7:3-19, (2012).
  • [16]. Ö. Eskidere, Ö. Aktaş, C. Ünal, “Voice disorders identification using discrete wavelet based features”, Medical Technologies National Conference (TIPTEKNO), 15-18 Oct., Bodrum, Turkey, (2015).
  • [17]. J.B. Alonso, F. Díaz-de-Maria, C.M. Trivieso, M.A. Ferrer, “Using nonlinear features for voice disorder detection,” 3rd Int. Conf. Nonlinear Speech Process, Barcelona, Spain, Apr. pp 94-106, (2005).
  • [18]. G. Muhammad, G. Altuwaijri, M. Alsulaiman, Z. Ali, T.A. Mesallam, M. Farahat, K.H. Malki, A. Al-nasheri, “Automatic voice pathology detection and classification using vocal tract area irregularity”, Biocybernetics and Biomedical Engineering, 36, 309 – 317, (2016).
  • [19]. B. Porat, B. Friendlander, “Direction finding algorithms based on high-order statistics”, IEEE Transaction on Signal Processing, 39(9):2016-2024, (1991).
  • [20]. K. Tanner, N. Roy, A. Ash, E.H. Buder, “Spectral moments of the long-term average spectrum: sensitive indices of voice change after therapy?”, Journal of Voice, 19(2):211-22, (2005).
  • [21]. S.Y. Lowell, R.H. Colton, R.T. Kelley, Y.C. Hahn, “Spectral- and cepstral-based measures during continuous speech: capacity to distinguish dysphonia and consistency within a speaker”, Journal of Voice, 25(5) :223-232, (2011).
  • [22]. E. Nemer, R. Goubran, S. Mahmoud, “Robust voice activity detection using higher-order statistics in the LPC residual domain”, IEEE Transaction on Speech and Audio Processing, 9(3):217-231, (2001).
  • [23]. E. Malkoç, “Türkçe ünlü formant frekans değerleri ve bu değerlere dayalı ünlü dörtgeni”, Dil Dergisi, 146:71-85, (2009).
  • [24]. K. Pearson, “Contribution to the mathematical theory of evolution, II. Skew variation in homogeneous material”, Philosophical Transactions of the Royal Society of London, 91:343-414, (1895).
  • [25]. A.M. Fiori, M. Zenga, “Karl Pearson and the origin of kurtosis”, International Statistical Review, 77(1):40- 50, (2009).
  • [26]. V. Vapnik, “Statistical learning theory”, John Wiley&Sons, New York, (1998).