Identification of the English Accent Spoken in Different Countries by the k-Nearest Neighbor Method

Sound is the pressure wave created by an object vibrating with a certain frequency. 3 organs are needed for the formation of voice in humans. These are lungs, vocal cords and mouth. Due to the structure of these organs and the similarity of the person with their current language, they can speak another language with different accent. A language can be spoken in different parts of the same country and in different countries. The second most widely used language in the world is English, has numerous accents around the world. In this study, it is aimed to determine which country the English accent spoken in different regions belongs to. In the dataset used, there are 330 sound samples including English accents spoken in Spain, France, Germany, Italy, England and America. Classification has been made with 12 features obtained by Mel Frequency Cepstrum Coefficients feature extraction method. k-Nearest Neighbor (kNN) were used in the classification and 87.2% success was achieved.

___

[1] Cai, Z.G., et al., Accent modulates access to word meaning: Evidence for a speaker-model account of spoken word recognition. Cognitive Psychology, 2017. 98: p. 73-101.

[2] Dunton, J., C. Bruce, and C. Newton, Investigating the impact of unfamiliar speaker accent on auditorycomprehension in adults with aphasia. International Journal of Language & Communication Disorders, 2015: p. 1-11.

[3] Ikeno, A. and J.H. Hansen, The effect of listener accent background on accent perception and comprehension. EURASIP Journal on Audio, Speech, and Music Processing, 2007. 2007(1): p. 076030.

[4] Asher, J.J. and R. García, The optimal age to learn a foreign language. The Modern Language Journal, 1969. 53(5): p. 334-341.

[5] Gupta, V. and P. Mermelstein, Effects of speaker accent on the performance of a speaker‐independent, isolated‐word recognizer. The Journal of the Acoustical Society of America, 1982. 71(6): p. 1581-1587.

[6] Pedersen, C. and J. Diederich. Accent classification using support vector machines. in 6th IEEE/ACIS International Conference on Computer and Information Science (ICIS 2007). 2007. IEEE.

[7] Gaikwad, S., B. Gawali, and K. Kale, Accent recognition for indian english using acoustic feature approach. International Journal of Computer Applications, 2013. 63(7).

[8] Mannepalli, K., P.N. Sastry, and M. Suman, MFCC-GMM based accent recognition system for Telugu speech signals. International Journal of Speech Technology, 2016. 19(1): p. 87-93.

[9] Hanani, A., M.J. Russell, and M.J. Carey, Human and computer recognition of regional accents and ethnic groups from British English speech. Computer Speech & Language, 2013. 27(1): p. 59-74.

[10] Nazzi, T., P.W. Jusczyk, and E.K. Johnson, Language discrimination by English-learning 5-month-olds: Effects of rhythm and familiarity. Journal of Memory and Language, 2000. 43(1): p. 1-19.

[11] Ma, Y., et al. Speaker accent recognition through statistical descriptors of Mel-bands spectral energy and neural network model. in 2012 IEEE Conference on Sustainable Utilization and Development in Engineering and Technology (STUDENT). 2012. IEEE.

[12] Ma, Z. and E. Fokoué, A comparison of classifiers in performing speaker accent recognition using MFCCs. arXiv preprint arXiv:1501.07866, 2015.

[13] Koklu, M. and I.A. Ozkan, Multiclass classification of dry beans using computer vision and machine learning techniques. Computers and Electronics in Agriculture, 2020. 174: p. 105507.

[14] Hossin, M. and M. Sulaiman, A review on evaluation metrics for data classification evaluations. International Journal of Data Mining & Knowledge Management Process, 2015. 5(2): p. 1.

[15] Kannan, R. and V. Vasanthi, Machine learning algorithms with ROC curve for predicting and diagnosing the heart disease, in Soft Computing and Medical Bioinformatics. 2019, Springer. p. 63-72.

[16] Liao, S., et al., Multi-object intergroup gesture recognition combined with fusion feature and KNN algorithm. Journal of Intelligent & Fuzzy Systems, 2020(Preprint): p. 1-11.