期刊名称:International Journal of Signal Processing, Image Processing and Pattern Recognition
印刷版ISSN:2005-4254
出版年度:2012
卷号:5
期号:2
出版社:SERSC
摘要:In this paper, we propose a method for character type classification based on a probabilistic topic model. The topic model is originally developed for topic discovery in text analysis using bag-of-words representation. Recent studies have shown the model is also useful for image analysis. We adopt the probabilistic topic model for character type classification. In our method, character type classification is carried out by classifying image patches based on their topic proportions. Since the performance of the method depends on a visual vocabulary generated by image feature extraction, we compare several feature extraction and description methods, and examine the relations to classification performance. In addition, by extending the method, we propose a coarse-to-fine approach to achieve stable character type classification for a small image patch. For that purpose, firstly, we partition an image into several patches which contain enough information to estimate the model parameters via EM algorithm. Then, each patch is subdivided into smaller patches. Estimation on the small patch is carried out by MAP-technique with a prior reflecting topic proportion of its parent patch. Through the experiments, we show accurate character type classification is made possible by the probabilistic topic model.
关键词:Character type classification; Probabilistic topic model; pLSA model; Bag-of-;words