Adaptive Histogram Analysis for Scene Text Binarization and Recognition

Authors

  • M. Basavanna Post Graduate Department of Computer Science Government College (Autonomous), Mandya Karnataka-India
  • P. Shivakumara Faculty of Computer Science and Information Technology, University of Malaya
  • S. K. Srivatsa St. Joseph College of Engineering Chennai-Tamil Nadu-India
  • G. Hemantha Kumar Department of Studies in Computer Science University of Mysore, Mysore Karnataka-India

DOI:

https://doi.org/10.22452/mjcs.vol29no2.1

Keywords:

Adaptive histogram analysis, Scene text binarization, Scene text recognition, Region growing, Word segmentation, Global thresholding

Abstract

Scene text binarization and recognition is a challenging task due to different appearance of text in clutter background and uneven illumination in natural scene images. In this paper, we present a new method based on adaptive histogram analysis for each sliding window over a word of a text line detected by the text detection method. The histogram analysis works on the basis that intensity values of text pixels in each sliding window have uniform color. The method segments the words based on region growing which studies spacing between words and characters. Then we propose to use existing OCRs such as ABBYY and Tesseract (Google) to recognize the text line at word and character levels to validate the binarization results. The method is compared with well-known global thresholding technique of binarization to show its effectiveness.

Downloads

Download data is not yet available.

Downloads

Published

2016-06-01

How to Cite

Basavanna, M., Shivakumara, P., Srivatsa, S. K., & Kumar, G. H. (2016). Adaptive Histogram Analysis for Scene Text Binarization and Recognition. Malaysian Journal of Computer Science, 29(2), 74–85. https://doi.org/10.22452/mjcs.vol29no2.1