中国科学技术大学学报 ›› 2014, Vol. 44 ›› Issue (2): 128-137.DOI: 10.3969/j.issn.0253-2778.2014.02.008

• 原创论文 • 上一篇    下一篇

基于可变性分析的紧致图像表达

赵 鑫   

  1. 1.中国科学技术大学自动化系,安徽合肥 230037; 2.中国科学院自动化研究所智能感知与计算研究中心,北京 100190
  • 收稿日期:2013-04-16 修回日期:2013-05-20 出版日期:2014-02-28 发布日期:2014-02-28
  • 通讯作者: 谭铁牛
  • 作者简介:赵鑫,男,1984年生,博士生. 研究方向:数字图像理解与分析,计算机视觉. E-mail:xzhao@nlpr.ia.ac.cn
  • 基金资助:
    国家重点基础研究发展(973)计划(2012CB316302),国家自然科学基金(61135002,61175007)资助.

Compact image representation based on variability analysis

ZHAO Xin   

  1. 1.Department of Automation, University of Science and Technology of China, Hefei 230027, China; 2.Center for Research on Intelligent Perception and Computing, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
  • Received:2013-04-16 Revised:2013-05-20 Online:2014-02-28 Published:2014-02-28

摘要: 图像表达是图像分类中最基本也是最重要的一个环节,当前的图像表达方法为了获得较高的分类性能,通常采用维度极高的特征向量.这给分类器的训练和特征的存储带来了极大的负担.同时,这些方法没有考虑图像的变化给图像表达所带来的影响.为此,针对以上的问题提出了一种对图像的可变性进行建模的方法.该方法首先使用高斯混合模型对底层视觉特征进行建模;再构造图像的充分统计量;最后采用可变性分析对充分统计量进行分解,并结合偏最小二乘回归方法获得紧致的图像表达.在公开的主流图像分类数据库上,该方法在获得更高的分类性能的同时极大地降低了分类器的训练和特征存储的开销.

关键词: 图像表达, 图像分类, 可变性分析, 因子分析, 偏最小二乘

Abstract: Image representation is the most fundamental and important aspect in image classification tasks. Most existing image representation methods use quite high dimensional feature vectors for image representation in order to achieve desired performance, which results in an inevitable drawback which is a classification problem with very high-dimensional feature vectors. Meanwhile, the existing methods have not considered image variations in image representation. Thus, an image representation method was proposed to model the variability in image classification. First, a Gaussian mixture model (GMM) was used to model the low-level visual feature vectors. Then, the sufficient statistics of images were constructed. Finally, the proposed variability analysis was utilized to decompose the sufficient statistics, and a compact image representation was obtained by means of partial least square regression. The proposed method not only achieves better performance on the public image classification datasets, but also reduces the burdens of classifier training and feature storage.

Key words: image representation, image classification, variability analysis, factor analysis, partial least square