TOWARDS AUTOMATIC SPEECH RECOGNITION FOR THE TATAR LANGUAGE

TOWARDS AUTOMATIC SPEECH RECOGNITION FOR THE TATAR LANGUAGE

In this paper we describe an approach to create automatic speech recognition systems for the Tatar language. We developed speech analysis platform to work with under-resourced languages and used this tool to create baseline speech recognition system. Additionally, some changes have been made to this language-independent system to take into account specific Tatar morphological structure. The resulting adapted system showed 75% accuracy on testing audio records.

___

  • [1] Lewis, M. Paul, Gary F. Simons, Charles D. Fennig (eds.). “Ethnologue: Languages of the World”, Dallas, Texas: SIL International, 2013.
  • [2] Khusainov A.F. “Automatic phoneme recognition system for the Tatar language”. In: The 1st International Conference “TurkLang”, Astana, 2013, pp 211–217.
  • [3] Young S., Kershaw D., Odell J., Ollason D., Valtchev V., Woodland Ph. The HTK Book [Electronic resource]. URL: http://nesl.ee.ucla.edu/projects/ibadge/docs/AS R/htk/htkbook.pdf.
  • [4] Kurimo M, Puurula A., Arisoy E., Alumae T., Saraclar M.. “Unlimited vocabulary speech recognition for agglutinative languages”. In: HLT-NAACL, NY, USA, 2006, pp 487–494