首页    期刊浏览 2024年12月11日 星期三
登录注册

文章基本信息

  • 标题:Recognition of Devanagari Scene Text Using Autoencoder CNN
  • 本地全文:下载
  • 作者:Sankirti Sandeep Shiravale ; Jayadevan R ; Sanjeev S Sannakki
  • 期刊名称:ELCVIA: electronic letters on computer vision and image analysis
  • 印刷版ISSN:1577-5097
  • 出版年度:2021
  • 卷号:20
  • 期号:1
  • 页码:55-69
  • DOI:10.5565/rev/elcvia.1344
  • 出版社:Centre de Visió per Computador
  • 摘要:Scene text recognition is a well-rooted research domain covering a diverse application area. Recognition of scene text is challenging due to the complex nature of scene images. Various structural characteristics of the script also influence the recognition process. Text and background segmentation is a mandatory step in the scene text recognition process. A text recognition system produces the most accurate results if the structural and contextual information is preserved by the segmentation technique. Therefore, an attempt is made here to develop a robust foreground/background segmentation(separation) technique that produces the highest recognition results. A ground-truth dataset containing Devanagari scene text images is prepared for the experimentation. An encoder-decoder convolutional neural network model is used for text/background segmentation. The model is trained with Devanagari scene text images for pixel-wise classification of text and background. The segmented text is then recognized using an existing OCR engine (Tesseract). The word and character level recognition rates are computed and compared with other existing segmentation techniques to establish the effectiveness of the proposed technique.
  • 其他摘要:Scene text recognition is a well-rooted research domain covering a diverse application area. Recognition of scene text is challenging due to the complex nature of scene images. Various structural characteristics of the script also influence the recognition process. Text and background segmentation is a mandatory step in the scene text recognition process. A text recognition system produces the most accurate results if the structural and contextual information is preserved by the segmentation technique. Therefore, an attempt is made here to develop a robust foreground/background segmentation(separation) technique that produces the highest recognition results. A ground-truth dataset containing Devanagari scene text images is prepared for the experimentation. An encoder-decoder convolutional neural network model is used for text/background segmentation. The model is trained with Devanagari scene text images for pixel-wise classification of text and background. The segmented text is then recognized using an existing OCR engine (Tesseract). The word and character level recognition rates are computed and compared with other existing segmentation techniques to establish the effectiveness of the proposed technique.
  • 关键词:Character and Text recognition;scene text recognition; Devanagari script; OCR; segmentation technique; encoder-decoder CNN
国家哲学社会科学文献中心版权所有