首页    期刊浏览 2024年12月13日 星期五
登录注册

文章基本信息

  • 标题:Pre-Processing Images of Public Signage for OCR Conversion
  • 本地全文:下载
  • 作者:Amber Khan ; Mariam Nida Usmani ; Nashrah Rahman
  • 期刊名称:Journal of Signal and Information Processing
  • 印刷版ISSN:2159-4465
  • 电子版ISSN:2159-4481
  • 出版年度:2019
  • 卷号:10
  • 期号:01
  • 页码:1-11
  • DOI:10.4236/jsip.2019.101001
  • 出版社:Scientific Research Publishing
  • 摘要:In this paper, we propose a novel method to enhance the OCR (Optical Character Recognition) readability of public signboards captured by smart-phone cameras—both outdoors and indoors, and subject to various lighting conditions. A distinct feature of our technique is the detection of these signs in the HSV (Hue, Saturation and Value) color space, done in order to filter out the signboard from the background, and correctly interpret the textual details of each signboard. This is then binarized using a thresholding technique that is optimized for text printed on contrasting backgrounds, and passed through the Tesseract engine to detect individual characters. We test out our technique on a dataset of over 200 images taken in and around the campus of our college, and are successful in attaining better OCR results in comparison to traditional methods. Further, we suggest the utilization of a method to automatically assign ROIs (Regions Of Interest) to detected signboards, for better recognition of textual information.
  • 关键词:Image Processing;HSV;Binarization;OCR
国家哲学社会科学文献中心版权所有