首页    期刊浏览 2025年03月03日 星期一
登录注册

文章基本信息

  • 标题:Methods of Arabic Language Baseline Detection ? The State of Art
  • 作者:Atallah AL-Shatnawi ; Khairuddin Omar
  • 期刊名称:International Journal of Computer Science and Network Security
  • 印刷版ISSN:1738-7906
  • 出版年度:2008
  • 卷号:8
  • 期号:10
  • 页码:137-143
  • 出版社:International Journal of Computer Science and Network Security
  • 摘要:Preprocessing is the most important stage in the Arabic OCR system; it has a direct effect on the reliability and efficiency of the segmentation and feature extraction stages. It is worth mentioning that Arabic language is cursively written, and its characters have between 2 to 4 shapes. An Arabic word likely consists of two or more characters which are connected through an imaginary line called baseline. Detecting baseline is one of the main majorities in preprocessing Arabic OCR system. The baseline can be used for both skew normalization and character segmentation. This paper aims to provide a comprehensive review of the methods proposed by researchers to detect Arabic baseline. The Arabic baseline detection methods are categorized into four methods: (a) based on horizontal projection methods, (b) based on word skeleton method, (c) based on contour tracing method, and (d) based on principle component analysis method. Each of these methods has its own advantages and drawbacks.
  • 关键词:Preprocessing; OCR; Handwritten; Offline; Arabic Baseline
Loading...
联系我们|关于我们|网站声明
国家哲学社会科学文献中心版权所有