Enhanced computer vision with Microsoft Kinect sensor: A review

Jungong Han, Ling Shao, Dong Xu, Jamie Shotton

Research output: Contribution to journalArticlepeer-review

1155 Citations (Scopus)

Abstract

With the invention of the low-cost Microsoft Kinect sensor, high-resolution depth and visual (RGB) sensing has become available for widespread use. The complementary nature of the depth and visual information provided by the Kinect sensor opens up new opportunities to solve fundamental problems in computer vision. This paper presents a comprehensive review of recent Kinect-based computer vision algorithms and applications. The reviewed approaches are classified according to the type of vision problems that can be addressed or enhanced by means of the Kinect sensor. The covered topics include preprocessing, object tracking and recognition, human activity analysis, hand gesture analysis, and indoor 3-D mapping. For each category of methods, we outline their main algorithmic contributions and summarize their advantages/differences compared to their RGB counterparts. Finally, we give an overview of the challenges in this field and future research trends. This paper is expected to serve as a tutorial and source of references for Kinect-based computer vision researchers.
Original languageEnglish
Pages (from-to)1318-1334
Number of pages17
JournalIEEE Transactions on Cybernetics
Volume43
Issue number5
Early online date25 Jun 2013
DOIs
Publication statusPublished - 1 Oct 2013

Cite this