A word spotting method for Farsi machine-printed document images

In this paper, a word spotting approach for Farsi printed document images has been presented. The main idea of the paper is the font recognition of Farsi document images and query word modification according to the document image's font before searching. This operation increases the similarity between the query word image and its instances in the document image; therefore, the performance of the word spotting system increases. In the proposed word spotting approach, after the query word modification, the query word image rectangle is searched in the text lines of the document image using XNOR similarity measurement. In order to increase the recall rate, we considered an almost low value as an acceptance/rejection threshold (d) and in order to increase precision rate, we used some other features, e.g., number of holes, ascenders, descenders, and dots. With multilevel matching and considering the mentioned features, the problem of justifying the operation (aligning the text to both the left and right) that occurs during the writing of Farsi documents has been solved. This approach was applied on a computer-made dataset consisting of 440 Farsi printed document images, and a precision rate of 97.5% at a recall rate of 92.1% was obtained. Moreover, when applying this approach on a dataset consisting of 224 Farsi scanned document images, a precision rate of 87.6% at recall rate of 79.3% was obtained.

A word spotting method for Farsi machine-printed document images

In this paper, a word spotting approach for Farsi printed document images has been presented. The main idea of the paper is the font recognition of Farsi document images and query word modification according to the document image's font before searching. This operation increases the similarity between the query word image and its instances in the document image; therefore, the performance of the word spotting system increases. In the proposed word spotting approach, after the query word modification, the query word image rectangle is searched in the text lines of the document image using XNOR similarity measurement. In order to increase the recall rate, we considered an almost low value as an acceptance/rejection threshold (d) and in order to increase precision rate, we used some other features, e.g., number of holes, ascenders, descenders, and dots. With multilevel matching and considering the mentioned features, the problem of justifying the operation (aligning the text to both the left and right) that occurs during the writing of Farsi documents has been solved. This approach was applied on a computer-made dataset consisting of 440 Farsi printed document images, and a precision rate of 97.5% at a recall rate of 92.1% was obtained. Moreover, when applying this approach on a dataset consisting of 224 Farsi scanned document images, a precision rate of 87.6% at recall rate of 79.3% was obtained.

___

  • Y. Pourasad, H. Hassibi, M, Banaeyan, “Persian characters recognition based on spatial matching”, International Review on Computers and Software, Vol. 6, pp. 55–59, 2011.
  • R. Mehran, H. Pirsiavash, F. Razzazi, “A front-end OCR for omni-font Persian/Arabic cursive printed documents”, Proceedings of Digital Image Computing, Techniques and Applications, pp. 385–392, 2005. L. Yue, C.L. Tan, “Chinese word searching in imaged documents”, International Journal of Pattern Recognition and Artificial Intelligence, Vol. 18, pp. 229–246, 2004.
  • H.K. Soo, C.P. Sang, B.J. Chang, S.K. Ji, H.R. Park, S.L. Guee, “Keyword spotting on Korean document images by matching the keyword image”, Proceedings of the 8th International Conference on Asian Digital Libraries: Implementing Strategies and Sharing Experiences, Vol. 3815, pp. 158–166, 2005.
  • S.N. Srihari, H. Srinivasan, C. Huang, S. Shetty, “Spotting words in Latin, Devanagari and Arabic scripts”, Indian Journal of Artificial Intelligence, Vol. 16, pp. 2–9, 2006.
  • S.N. Srihari, H. Srinivasan, P. Babu, C. Bhole, “Spotting words in handwritten Arabic documents”, Proceedings of the SPIE, pp. 606702-1–606702-12, 2006.
  • S.N. Srihari, H. Srinivasan, P. Babu, C. Bhole, “Handwritten Arabic word spotting using the CEDARABIC document analysis system”, Proceedings of Symposium on Document Image Understanding Technology, pp. 123– 132, 2005.
  • R. Saabni, J. El-Sana, “Keyword searching for Arabic handwritten documents”, Proceedings of the 11th International Conference on Frontiers on Handwritten Recognition, pp. 271–277, 2008.
  • A.L. Spitz, “Using character shape codes for word spotting in document images”, Shape, Structure and Pattern Recognition, pp. 22–27, 1994.
  • S. Lu, C.L. Tan, “Retrieval of machine printed Latin documents through word shape coding”, Pattern Recognition, Vol. 41, pp. 1816–1826, 2008.
  • R. Manmatha, C. Han, E.M. Riseman, “Word spotting: a new approach to indexing handwriting”, Proceedings of Computer Vision and Pattern Recognition Conference, pp. 631–637, 1996.
  • R. Manmatha, T.M. Rath, “Indexing of handwritten historical documents - recent progress”, Proceedings of the Symposium on Document Image Understanding Technology, pp. 77–85, 2003.
  • S. Belongie, J. Malik, “Matching with shape contexts”, IEEE Workshop on Content-Based Access of Image and Video Libraries, pp. 20–26, 2000.
  • G.L. Scott, H.C. Languet-Higgins, “An algorithm for associating the features of two patterns”, Proceedings of the Royal Society of London, Vol. B224, pp. 21–26, 1991.
  • T.M. Rath, R. Manmatha, “Word spotting for historical documents”, International Journal of Document Analysis and Recognition, Vol. 9, pp. 139–152, 2007.
  • T. Konidaris, B. Gatos, K. Ntzios, I. Pratikakis, H. Theoridis, S.J. Perantonis, “Keyword-guided word spotting in historical printed documents using synthetic data and user feedback”, International Journal of Document Analysis and Recognition, Vol. 9, pp. 166–177, 2007.
  • R. Manmatha, J. Rothfeder, “A scale space approach for automatically segmenting words from historical handwritten documents”, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 27, pp. 1212–1225, 200
  • B. Zhan, S.N. Srihari, C. Huang, “Word image retrieval using binary features”, Document Recognition and Retrieval XI, Proceedings of the SPIE, Vol. 5296, pp. 45–53, 2004.
  • N. Otsu, “A threshold selection method from gray-level histograms”, IEEE Transactions on Systems, Man and Cybernetics, Vol. 9, pp. 62–66, 1979.
  • H. Khosravi, E. Kabir, “Farsi font recognition based on Sobel-Roberts features”, Pattern Recognition Letters, Vol. 31, pp. 75–82, 2010.
  • Y. Pourasad, H. Hassibi, M. Banaeyan, “Farsi font recognition based on spatial matching”, 18th International Conference on Systems, Signals, and Image Processing, pp. 1–4, 2011.