AN INSIGHT INTO THE CODING PROCESS IN THE NATURAL LANGUAGES IN TERMS OF THE INFORMATION THEORY (IN THE EXAMPLE OF TURKISH)

This study focuses on the transformation that morphemes undergo in the coding process. This transformation is related to the use. Morphemes marked according to their articulation evolve according to the frequency in the messages. In the study, a method, which will be used in the cases that the Turkish morphemes are coded according to the information of value of Turkish morphemes. This method is based on the information theory which is developed by C. E. Shannon and which describes "meaning" as a measurable concept. This method relates the information value of symbols in the messages to their frequency in the messages. Therefore, it was focused on the relation between the frequency and their code numbers by giving several examples. The information of value of the several Turkish words is measured depending on their frequency and the code number which is necessary for the coding of each morpheme, depending on these values. However, it is necessary that a corpus should be formed and the separation of each text into morphemes and that the frequency of each morpheme within the corpus should be determined. At that stage, the data formed by the analysis of the text pieces and composed of 100 text pieces published specifically in the last decade and which have been prepared by the author of this study for a previous study. In addition, the entropy of the Turkish morphemes has been calculated by using their frequency in the corpus and, the relationship between the number of letters in the alphabet and the number of signs required to mark the words is presented. Finally, the preferred method in the study is presented for discussion. The preference between the required measurement method and the possible measurement method is at a dimension which will require the questioning of the information theory.

___

Abramson, N. (1953). Information Theory and Coding. McGraww-Hill Book Company Press.

Barthes, R. (1979). Göstergebilimin İlkeleri. (B. Vardar and M. Rifat, Trans.). Ministry of Culture Press.

Diri, B. (1999). Türkçenin Biçimbilim Yapısına Dayalı Bir Metin Sıkıştırma Sistemi [Unpublished doctoral dissetation]. Yildiz Technical University.

Faulmann, C. (2018). Yazı Kitabı. (I. Arda, Trans.). Türkiye İş Bankası Cultural Press.

Gemalmaz, E. (1982). Standart Türkiye Türkçesinin Formanlarının Enformatif Değerleri ve Bu Değerlerin İhtiyaç Hâlinde Bu Dilin Gelişimine Muhtemel Etkileri. http://efrasiyap.tripod.com/yazilar/STT1.pdf

Gülensoy, T. (2007). Türkiye Türkçesindeki Türkçe Sözcüklerin Köken Bilgisi Sözlüğü, Türk Dil Kurumu Press.

Hartley, R. V. L. (1928). "Transmission of information." The Bell System Technical Journal, 7, 535-563. http://telecomlaw.ru/young_res/trans_inform_1928.pdf

Ifrah, G. (1998). Rakamların Evrensel Tarihi. (K. Dinçer, Trans.). TUBİTAK Press.

İmer, K., Kocaman, A. & Özsoy A. (2011). Dilbilim Sözlüğü. Boğaziçi University Press.

Korkmaz, Z. (1960). "Türk Dilinde +ça Eki ve Bu Ek ile Yapılan İsim Teşkilleri Üzerine Bir Deneme." Ankara Üniversitesi Dil ve Tarih Coğrafya Fakültesi Dergisi, 17, 275-358.

Lévi-Strauss, C. (2013). Mit ve Anlam. (G. Y. Demir, Trans.). Ithaki Press.

Martinet, A. (1998). İşlevsel Genel Dilbilim. (B. Vardar, Trans.). Multilingual Press.

Piaget, J. (1982). Yapısalcılık. (F. Akadlı, Trans.). Dost Kitabevi Press.

Saussure, F. (1998). Genel Dilbilim Dersleri. (B. Vardar, Trans.). Multilingual Press.

Sedor, K. (2015). The Law of Large Numbers and its Applications. Lakehead Universty Press.

Sertkaya, O. F. (2012). "Etimoloji Nedir - Ne Değildir ve İsimden İsim Yapan +ay/+ey Eki Üzerine." Ankara Üniversitesi Dil ve Tarih Coğrafya Fakültesi Türkoloji Dergisi, 19, 43-72.

Shannon, C. E. (1948). "A Mathematical Theory of Communication." The Bell System Technical Journal, 27, 379- 473. http://people.math.harvard.edu/~ctm/home/text/others/shannon/entropy/entropy.pdf

Şeka, Y. V. (2011). "Türkiye Türkçesinde Çokanlamlılık ve Eşgösterenlilik." (R. Adzhumerova ve E. Atmaca, Trans.). The Journal of Language Researches, 9, 165-172.

Yılmaz, Ç. (2018). Türkçede Anlam Birimlerinin Bilgi Kuramı Temelinde İşaretlenmesi. Gazi Kitabevi Press. https://github.com/COMU/zemberek-extension#giriş