A Novel Framework for Text Recognition in Street View Images

This paper addresses a new text recognition solution, which is mainly used for the detection of street view images. This paper employs two different approaches to detect text-based regions and recognise corresponding text fields. The first approach utilises maximally stable extremal regions (MSER), whereas the second approach relies on the class specific extremal regions (CSER) algorithm. Two separate frameworks, designed with respect to the aforementioned methods, are applied to the street view images so as to extract textbased regions. Numerous experiments were performed to evaluate and compare both approaches. Results obtained from the CSER-based approach are especially quite encouraging and verify the system’s ability to detect text-based regions and recognise corresponding text fields

___

[1] R. D. Brown, “Example-based machine translation in the pangloss system,” in Proceedings of the 16th International Conference on Computational Linguistics, Copenhagen, DK, 1996, pp. 169-174.

[2] Y. Cui and Q. Huang, “Character Extraction of License Plates from Video,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, San Juan, USA, 1997, pp. 502-507.

[3] J. Gao and J. Yang, “An Adaptive Algorithm for Text Detection from Natural Scenes," in Proceedings of Computer Vision and Pattern Recognition, 2001, (CVPR 2001).

[4] J. W. Hutchins, “Machine Translation: Past, Present,” Future, Ellis Horwood Limited, England, 1986.

[5] A. K. Jain and B. Yu, “Automatic text location in images and video frames,” Pattern Recognition, vol. 31, no. 12, pp. 2055-2076, 1998.

[6] K. S. Lahari, “Text Detection from Natural Image using MSER and BOW,”IJEERT, vol.3, pp. 152-156, 2015.

[7] Q. Ye and D. Doermann, “Text Detection and Recognition in Imagery: A Survey,” IEEE Transcactions on Pattern Analysis and Machine Intelligence, vol. 37, no. 7, pp. 1480-1500, 2015.

[8] K. Wadhawan and E. Gajendran, “Automatic Recognition of Text in Images: A Survey”, International Journal of Computer Applications, vol. 127, no. 15, pp. 15-19, 2015.

[9] K. Elagouni, C. Garcia, and P. Sbillot, “A comprehensive neuralbased approach for text recognition in videos using natural language processing,” in Proc. ACM Conf. Multimedia Retrieval, 2011

[10] C. Yao, X. Zhang, X. Bail, W. Liu, Y. Ma and Z. Tu, “Detecting texts of arbitrary orientations in nautral images,” in Proc. IEEE Int. Conf. Comput. Vis. Pattern Rec., pp. 1083-1090, 2012.

[11] W. Wu, D. Chen, and J. Yang, “Integrating co-training and recognition for text detection, “in Proc. IEEE Int. Conf. Multimedia Expo, pp. 1169-1169, 2005.

[12] L. Tang and J. R. Kender, “A unified text extraction method for instructional videos, “ in Proc. IEEE Int. Conf. Image Process, pp. 1216-1219, 2005.

[13] T. Wang et al., “End-to-end text recognition with CNN, “ in Proc. IEE Int. Conf. Pattern Recognition, pp 3304-3308, 2012.

[14] A. Bissacco et al., “PhotoOCR: Reading Text in Uncontrolled Conditions, “ IEEE International Conferenceo n Computer Vision, Sydney, NSW, 2013, pp. 785-792, 2013.

[15] S. Yousfi et al., “Deep Learning and recurrent connectionist-based approaches for Arabic text recognition in videos,” 13th International Conference on Document Analysis and Recognition (ICDAR), Tunis, pp. 1026-1030, 2015.

[16] G. Nagaraju et al., “Text Extraction From Images With EdgeEnhanced MSER And Hardware Interfacing Using Arduino”, IJECS, vol.4, pp 11798-11803, 2015.

[17] B. Epshtein et al., “Detecting text in natural scenes with stroke width transform,” in CVPR, CA, USA, pp. 2963-2970, 2010.

[18] P. Shivakumara et al., “A Laplacian approach to multi-oriented text detection in video,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 33, pp. 412-419, 2011.

[19] L. Neumann et al.,“Real-Time Scene Text Localization and Recognition,” in 25th IEEE Conference on Computer Vision and Patter Rec. , RI, USA, 16-22 June, 2012.

[20] G. Li et al., “Scene text detection with extremal region based cascade filtering, “ in IEEE ICP Conference, Phoenix , AZ, USA, pp. 2896-2900, 25-28 Sept, 2016.

[21] J. Mataset et al., “A new class of learnable detectors for categorization, “In Image Analysis, vol. 3540 of LNCS, pp. 541- 550, 2005.

[22] K. R. Muller et al.,” An introduction to kernel-based learning algorithms, “ IEEE Trans. on NN., vol. 12, pp. 181-201, 2001.