DEEP LEARNING METHOD FOR HANDWRITING RECOGNITION

DEEP LEARNING METHOD FOR HANDWRITING RECOGNITION

The advancement of technology nowadays resulted into documents, such as forms and petitions, being filled out in computer and digital environment. Yet in some cases, documents are still preserved in traditional style, on print. Due to its distinct proportions, however, its storage, sharing and filing has become a complication. The relocation of these written documents to digital environment is therefore of great significance. In this view, this study aims to explore methodologies of digitizing handwritten documents. In this study, the documents converted to image format were pre-processed using image processing methods. These operations include dividing lines of the document into image format, dividing into words which then divided into characters, and finally, a classification operation on the characters. As classification phase, one of the deep learning methods is the Convolution Neural Network method is used in image recognition. The model was trained using the EMNIST dataset, and in the character, dataset created from the documents at hand. The dataset created had a success rate of 87.81%. Characters classified as finishers are sequentially combined and the document is transferred to the computer afterwards.

___

  • [1] B. Engel-Yeger, L. Nagauker-Yanuv, and S. Rosenblum, “Handwriting performance, self-reports, and perceived self-efficacy among children with dysgraphia,” American Journal of Occupational herapy, vol. 63, no. 2, pp. 182-192, 2009.
  • [2] R. Plamondon, and S. N. Srihari, “Online and off-line handwriting recognition: a comprehensive survey,” IEEE Transactions on pattern analysis and machine intelligence, vol. 22, no. 1, pp. 63-84, 2000.
  • [3] S. N. Srihari, “High-performance reading machines,” Proceedings of the IEEE, vol. 80, no. 7, pp. 1120-1132, 1992.
  • [4] G. Seni, R. K. Srihari, and N. Nasrabadi, “Large vocabulary recognition of on-line handwritten cursive words,” IEEE Transactions on pattern analysis and machine intelligence, vol. 18, no. 7, pp. 757-762, 1996.
  • [5] P. P. Roy, A. K. Bhunia, A. Das et al., “Keyword spotting in doctor's handwriting on medical prescriptions,” Expert Systems with Applications, vol. 76, pp. 113-128, 2017.
  • [6] S. N. Srihari, S.-H. Cha, H. Arora et al., “Handwriting identification: Research to study validity of individuality of handwriting and develop computerassisted procedures for comparing handwriting,” Technical Report CEDAR-TR-01-1, 2001.
  • [7] Fanany, M. I. Handwriting recognition on form document using convolutional neural network and support vector machines (CNN-SVM). In 2017 5th international conference on information and communication technology (ICoIC7) (pp. 1-6). IEEE. (2017, May).
  • [8] D. Mahapatra, C. Choudhury, and R. K. Karsh, " Mahapatra, D., Choudhury, C., & Karsh, R. K. Handwritten character recognition using knn and svm based classifier over feature vector from autoencoder. In International Conference on Machine Learning, Image Processing, Network Security and Data Sciences (pp. 304-317). Springer, Singapore. 2020, July.
  • [9] P. Saha, and A. Jaiswal, "Handwriting Recognition Using Active Contour," Artificial Intelligence and Evolutionary Computations in Engineering Systems, pp. 505-514: Springer, 2020.
  • [10] S. A. Gregory Cohen, Jonathan Tapson, and Andre van Schaik, “EMNIST: an extension of MNIST to handwritten letters,” 2017.
  • [11] Baykal, B., Aktaş, T. Ö., & Yildiz, O. “Makine Öğrenmesi Yöntemlerİ İle Otomatik Çevrimdışı İmza Tanıma ve Doğrulama Sistemi”. In 2017 International Artificial Intelligence and Data Processing Symposium (IDAP) (pp. 1-5). IEEE. 2017, September.
  • [12] NIST, “The EMNIST Dataset,” 2017, April.
  • [13] Sun, B., Yang, L., Zhang, W., Dong, P., Young, C., Dong, J., & Lin, M. Demonstration of applications in computer vision and nlp on ultra power-efficient cnn domain specific accelerator with 9.3 tops/watt. In 2019 IEEE International Conference on Multimedia & Expo Workshops (ICMEW) (pp. 611-611). IEEE. (2019, July).
  • [14] Akhtyamova, L., Ignatov, A., & Cardiff, J. A Largescale CNN ensemble for medication safety analysis. In International Conference on Applications of Natural Language to Information Systems (pp. 247-253). Springer, Cham. 2017, June.
  • [15] M. Cho, J. Ha, C. Park et al., “Combinatorial feature embedding based on CNN and LSTM for biomedical named entity recognition,” Journal of Biomedical Informatics, vol. 103, pp. 103381, 2020.
  • [16] Momeni, A., Thibault, M., & Gevaert, O. Dropoutenabled ensemble learning for multi-scale biomedical data. In International MICCAI Brainlesion Workshop (pp. 407-415). Springer, Cham. 2018, September.
  • [17] M. Amin-Naji, A. Aghagolzadeh, and M. Ezoji, “Ensemble of CNN for multi-focus image fusion,”Information fusion, vol. 51, pp. 201-214, 2019.
  • [18] C. Tian, Y. Xu, and W. Zuo, “Image denoising using deep CNN with batch renormalization,” Neural Networks, vol. 121, pp. 461-473, 2020.
  • [19] Z. Mushtaq, S.-F. Su, and Q.-V. Tran, “Spectral images based environmental sound classification using CNN with meaningful data augmentation,” Applied Acoustics, vol. 172, pp. 107581, 2020.
  • [20] Y. Su, K. Zhang, J. Wang et al., “Environment sound classification using a two-stream CNN based on decision-level fusion,” Sensors, vol. 19, no. 7, pp. 1733, 2019.
  • [21] T. Ergin, “Convolutional Neural Network (ConvNet yada CNN) nedir, nasıl çalışır? ,” 2018,October