期刊名称:International Journal of Computer Science and Information Technologies
电子版ISSN:0975-9646
出版年度:2013
卷号:4
期号:1
页码:91-97
出版社:TechScience Publications
摘要:the implementation focuses a systematic approach to the design the Spell Checker for OCR. In this a spelling correction system, is designed specifically for OCR-generated text, that selects candidate words through the information gathered from multiple knowledge sources and automatically replaces with the correct word. This system for text correction based on approximate string matching, which uses a statistical model that incorporates techniques like Confusion Matrix and N-gram Analysis. The ability to accurately recognize characters by scanning hard copy images is extremely important for many forms of automated data processing and has wide application. A great deal of effort has been devoted to correcting errors which invariably result from commercially available OCR devices. Besides error patterns like substitution, transposition, insertion and deletion, emphasis is given on modifiers and their positions with respect to the consonants and conjuncts being modified. The system is developed using file management system through java and java Swing for the Windows operating system.
关键词:the implementation focuses a systematic approach;to the design the Spell Checker for OCR. In this a spelling;correction system; is designed specifically for OCR-generated;text; that selects candidate words through the information;gathered from multiple knowledge sources and automatically;replaces with the correct word. This system for text correction;based on approximate string matching; which uses a statistical;model that incorporates techniques like Confusion Matrix and;N-gram Analysis. The ability to accurately recognize;characters by scanning hard copy images is extremely;important for many forms of automated data processing and;has wide application. A great deal of effort has been devoted to;correcting errors which invariably result from commercially;available OCR devices. Besides error patterns like;substitution; transposition; insertion and deletion; emphasis is;given on modifiers and their positions with respect to the;consonants and conjuncts being modified. The system is;developed using file management system through java and;java Swing for the Windows operating system.