文章基本信息

标题：An Intelligent Model for Visual Scene Analysis and Compression
本地全文：下载
作者：Amjad Rehman ; Tanzila Saba
期刊名称：The International Arab Journal of Information Technology
印刷版ISSN：1683-3198
出版年度：2013
卷号：10
期号：2
出版社：Zarqa Private University
摘要：This paper presents an improved approach for indicating visually salient regions of an image based upon a known visual search task. The proposed approach employs a robust model of instantaneous visual attention (i.e. “bottom-up”) combined with a pixel probability map derived from the automatic detection of a previously-seen object (task-dependent i.e. (“top-down”). The objects to be recognized are parameterized quickly in advance by a viewpoint-invariant spatial distribution of Speeded Up Robust Features (SURF) interest-points. The bottom-up and top-down object probability images are fused to produce a task-dependent saliency map. The proposed approach is validated using observer eye-tracker data collected under object search-and-count tasking. Proposed approach shows 13% higher overlap with true attention areas under task compared to bottom-up saliency alone. The new combined saliency map is further used to develop a new intelligent compression technique which is an extension of Discrete Cosine Transform (DCT) encoding. The proposed approach is demonstrated on surveillance-style footage throughout.
关键词：Visualization; discrete cosine transform; image compression; scene analysis.