首页    期刊浏览 2024年12月02日 星期一
登录注册

文章基本信息

  • 标题:Spell Checker for OCR
  • 本地全文:下载
  • 作者:Yogomaya Mohapatra ; Ashis Kumar Mishra ; Anil Kumar Mishra
  • 期刊名称:International Journal of Computer Science and Information Technologies
  • 电子版ISSN:0975-9646
  • 出版年度:2013
  • 卷号:4
  • 期号:1
  • 页码:91-97
  • 出版社:TechScience Publications
  • 摘要:the implementation focuses a systematic approach to the design the Spell Checker for OCR. In this a spelling correction system, is designed specifically for OCR-generated text, that selects candidate words through the information gathered from multiple knowledge sources and automatically replaces with the correct word. This system for text correction based on approximate string matching, which uses a statistical model that incorporates techniques like Confusion Matrix and N-gram Analysis. The ability to accurately recognize characters by scanning hard copy images is extremely important for many forms of automated data processing and has wide application. A great deal of effort has been devoted to correcting errors which invariably result from commercially available OCR devices. Besides error patterns like substitution, transposition, insertion and deletion, emphasis is given on modifiers and their positions with respect to the consonants and conjuncts being modified. The system is developed using file management system through java and java Swing for the Windows operating system.
  • 关键词:the implementation focuses a systematic approach;to the design the Spell Checker for OCR. In this a spelling;correction system; is designed specifically for OCR-generated;text; that selects candidate words through the information;gathered from multiple knowledge sources and automatically;replaces with the correct word. This system for text correction;based on approximate string matching; which uses a statistical;model that incorporates techniques like Confusion Matrix and;N-gram Analysis. The ability to accurately recognize;characters by scanning hard copy images is extremely;important for many forms of automated data processing and;has wide application. A great deal of effort has been devoted to;correcting errors which invariably result from commercially;available OCR devices. Besides error patterns like;substitution; transposition; insertion and deletion; emphasis is;given on modifiers and their positions with respect to the;consonants and conjuncts being modified. The system is;developed using file management system through java and;java Swing for the Windows operating system.
国家哲学社会科学文献中心版权所有