A new Morse code scheme optimized according to the statistical properties of Turkish
Morse code has been in use for more than 180 years, even though its currently known form is slightly different than the form defined by Morse and Vail. The code book constructed by Vail was optimized according to the statistical properties of English. In this study, we propose a new code book optimized for Turkish and demonstrate that it is information-theoretically possible to achieve about a 10% improvement throughout the coding of Turkish texts by means of our proposal. The outcomes of this might serve as a basis for potential (academic and/or applied) Turkish language-specific lossless data compression studies.
A new Morse code scheme optimized according to the statistical properties of Turkish
Morse code has been in use for more than 180 years, even though its currently known form is slightly different than the form defined by Morse and Vail. The code book constructed by Vail was optimized according to the statistical properties of English. In this study, we propose a new code book optimized for Turkish and demonstrate that it is information-theoretically possible to achieve about a 10% improvement throughout the coding of Turkish texts by means of our proposal. The outcomes of this might serve as a basis for potential (academic and/or applied) Turkish language-specific lossless data compression studies.
___
- R.W. Burns, Communications: An International History of the Formative Years, London, Institution of Electrical Engineers, 2004.
- “Channel Encoding”, Encyclopedia Britannica, 2010. Retrieved from Encyclopedia Britannica Online on 31 October 2010: http://www.britannica.com/EBchecked/topic/105743/channel-encoding.
- C.E. Shannon, “A mathematical theory of communication”, Bell Systems Technical Journal, Vol. 27, pp. 379–423, 19 R.M. Fano, “The transmission of information”, Technical Report No. 65 at Research Laboratory of Electronics, Cambridge, MIT Press, 1949.
- D.A. Huffman, “A method for the construction of minimum-redundancy codes”, Proceedings of the Institute of Radio Engineers, Vol. 40, pp. 1098–1102, 1952.
- Y. C ¸ ebi, G. Dalkılı¸c, “Turkish word n-gram analyzing algorithms for a large scale Turkish corpus – TurCo”, Proceedings of the IEEE International Conference on Information Technology, Vol. 2, pp. 236–240, 2004. ¨ O.S. Ero˘ glu, “Spelling check and correction by using syllable n-gram models”, MSc, Department of Computer Engineering, ˙Istanbul Technical University, ˙Istanbul, Turkey, 2005 (in Turkish).
- R. A¸slıyan, K. G¨ unel, “Turkish automatic syllabification system and syllable statistics”, Proceedings of Academic Informatics, pp. 31–38, 2008 (in Turkish).
- E. C ¸ i¸cek, “N-gram and syllable based statistical properties of Turkish: potential applications”, BSc, Department of Electronics Engineering, Ankara University, Ankara, Turkey, 2010 (in Turkish).
- E. C ¸ i¸cek, A.E. Yılmaz, “A study on the n-gram and syllable based statistical properties of Turkish”, Proceedings of the 3rd Engineering and Technology Symposium, pp. 68–77, 2010 (in Turkish).
- “Morse code”, Wikipedia, 20 Retrieved from Wikipedia on 31 October 2010: http://en.wikipedia.org/wiki/Morse code.