Journal of University of Science and Technology of China ›› 2018, Vol. 48 ›› Issue (4): 322-330.DOI: 10.3969/j.issn.0253-2778.2018.04.008

• Original Paper • Previous Articles     Next Articles

Cross-media semantic retrieval with deep canonical correlation analysis

WANG Shu, SHI Zhongzhi   

  1. 1. Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190;
  • Received:2017-06-01 Revised:2017-07-14 Online:2018-04-30 Published:2018-04-30

Abstract: The cross-media retrieval with canonical correlation analysis (CCA) is a method to map different media features to the largest correlation isomorphism subspace through the canonical correlation analysis, and compare the similarity between cross-media data in the subspace. However CCA is a linear model and can not adequately exploit the complex correlation between cross-media data. The structure of the traditional deep canonical correlation analysis (DCCA) is improved, and the latent dirichlet allocation (LDA) is used to discover the semantic information in the text data and learns the semantic mapping. The cross-media correlation learning with deep canonical correlation analysis (CMC-DCCA) and the cross-media semantic correlation retrieval (CMSCR) are proposed. Experiments on the Wikipedia text image dataset shows that the CMC-DCCA model can mine the complex correlation between cross-media data better, and that CMSCR has better performance in cross-media retrieval.

Key words: canonical correlation analysis, deep canonical correlation analysis, semantic mapping, cross-media retrieval

CLC Number: