FİGEN ERTAŞ, Ömer ESKİDERE

Yazılım tabanlı sözcük sentezleyici

Formant sentezleme tekniğine dayanan bir Türkçe sözcük sentezleyici geliştirilmiştir. Sentezleyici normalde kaskat/paralel modunda çalışmasına rağmen, sadece bir anahtar yardımı ile, alternatif olarak gerektiğinde paralel modda kullanılabilir. Kullandığımız sözcük sentezleyicinin en önemli özelliği önceden kaydedilmiş konuşma örneklerine ihtiyaç duymadan doğrudan ses yolu modeli ile yapay insan sesi üretmesidir. Her bir ses, 20'si değişken ve 19'u sabit olmak üzere 39 parametre ile karakterize edilmiştir. Programdaki formant frekansları, formant band genişlikleri, temel frekans, vb. gibi değişken kontrol parametreleri kullanıcı tarafından belirlenir. Bu projedeki sabit parametreler belirli bir erkek sesi için uygun olarak seçilmiştir, farklı erkek veya kadın sesleri parametrelerde bazı değişiklikler yapılarak elde edilebilir. Yeterli hafıza ve donanıma sahip kişisel bir bilgisayar ortamında çalışabilen esnek bir yazılım tabanlı sentezleyici tanıtılmıştır. Sentezleyici ile elde edilen değişik kelimelerin, yapay konuşmaya alışkın olmayan eğitilmemiş kişiler ile gerçeklenen anlaşılabilirlik testi göstermiştir ki, ünlü harfler sesiz harflere göre daha doğru olarak belirlenmişlerdir.

Software based speech synthesiser

A software for implementing a Turkish speech synthesiser has been developed which is based on formant synthesis, method. The synthesiser is normally used in a cascade/parallel configuration, or alternatively in a parallel configuration depending on a single switch. Most important feature of the formant synthesiser used in this project is that the algorithm does not use previously recorded speech sounds, but rather generates synthetic speech using human vocal tract model. Each of speech sounds is characterized with 39 parameters in the software formant synthesizer. 20 of 39 parameters are variable and the others are constant. A control program lets the user specify variable control parameter data such as formant frequencies, formant bandwidths, fundamental frequency, etc. The constant parameters in this project have been used values appropriate for a particular male voice, and would have to adjusted slightly to approximate the speech of other male or female talkers. A flexible software synthesizer that can run on any personal computer having sufficient core and peripheral equipment has been described. Intelligibility tests of different words produced by synthesiser indicate that untrained people who are unfamiliar with synthetic speech identify vowels correctly better than consonants.