Weakly-supervised cross-domain dictionary learning for visual recognition

Fan Zhu, Ling Shao

Research output: Contribution to journalArticlepeer-review

195 Citations (Scopus)

Abstract

We address the visual categorization problem and present a method that utilizes weakly labeled data from other visual domains as the auxiliary source data for enhancing the original learning system. The proposed method aims to expand the intra-class diversity of original training data through the collaboration with the source data. In order to bring the original target domain data and the auxiliary source domain data into the same feature space, we introduce a weakly-supervised cross-domain dictionary learning method, which learns a reconstructive, discriminative and domain-adaptive dictionary pair and the corresponding classifier parameters without using any prior information. Such a method operates at a high level, and it can be applied to different cross-domain applications. To build up the auxiliary domain data, we manually collect images from Web pages, and select human actions of specific categories from a different dataset. The proposed method is evaluated for human action recognition, image classification and event recognition tasks on the UCF YouTube dataset, the Caltech101/256 datasets and the Kodak dataset, respectively, achieving outstanding results.
Original languageEnglish
Pages (from-to)42-59
Number of pages18
JournalInternational Journal of Computer Vision
Volume109
Issue number1
Early online date12 Mar 2014
DOIs
Publication statusPublished - Aug 2014

Cite this