Abstract
Conventional action recognition algorithms adopt a single type of feature or a simple concatenation of multiple features. In this paper, we propose to better fuse and embed different feature representations for action recognition using a novel spectral coding algorithm called Kernelized Multiview Projection (KMP). Computing the kernel matrices from different features/views via time-sequential distance learning, KMP can encode different features with different weights to achieve a low-dimensional and semantically meaningful subspace where the distribution of each view is sufficiently smooth and discriminative. More crucially, KMP is linear for the reproducing kernel Hilbert space, which allows it to be competent for various practical applications. We demonstrate KMP’s performance for action recognition on five popular action datasets and the results are consistently superior to state-of-the-art techniques.
Original language | English |
---|---|
Pages (from-to) | 115-129 |
Number of pages | 15 |
Journal | International Journal of Computer Vision |
Volume | 118 |
Issue number | 2 |
Early online date | 5 Oct 2015 |
DOIs | |
Publication status | Published - Jun 2016 |
Keywords
- Human action recognition
- Sequential distance learning
- Multiple view fusion
- Dimensionality reduction
- Spectral coding