Learning discriminative key poses for action recognition

Li Liu, Ling Shao, Xiantong Zhen, Xuelong Li

Research output: Contribution to journalArticlepeer-review

84 Citations (Scopus)


In this paper, we present a new approach for human action recognition based on key-pose selection and representation. Poses in video frames are described by the proposed extensive pyramidal features (EPFs), which include the Gabor, Gaussian, and wavelet pyramids. These features are able to encode the orientation, intensity, and contour information and therefore provide an informative representation of human poses. Due to the fact that not all poses in a sequence are discriminative and representative, we further utilize the AdaBoost algorithm to learn a subset of discriminative poses. Given the boosted poses for each video sequence, a new classifier named weighted local naive Bayes nearest neighbor is proposed for the final action classification, which is demonstrated to be more accurate and robust than other classifiers, e.g., support vector machine (SVM) and naive Bayes nearest neighbor. The proposed method is systematically evaluated on the KTH data set, the Weizmann data set, the multiview IXMAS data set, and the challenging HMDB51 data set. Experimental results manifest that our method outperforms the state-of-the-art techniques in terms of recognition rate.
Original languageEnglish
Pages (from-to)1860-1870
Number of pages11
JournalIEEE Transactions on Cybernetics
Issue number6
Early online date11 Jan 2013
Publication statusPublished - 1 Dec 2013

Cite this