Combining heterogeneous features for 3D hand-held object recognition

Xiong Lv; Shuang Wang; Xiangyang Li; Shuqiang Jiang

doi:10.1117/12.2071381

29 October 2014 Combining heterogenous features for 3D hand-held object recognition

Xiong Lv, Shuang Wang, Xiangyang Li, Shuqiang Jiang

Proceedings Volume 9273, Optoelectronic Imaging and Multimedia Technology III; 92732I (2014) https://doi.org/10.1117/12.2071381
Event: SPIE/COS Photonics Asia, 2014, Beijing, China

Abstract

Object recognition has wide applications in the area of human-machine interaction and multimedia retrieval. However, due to the problem of visual polysemous and concept polymorphism, it is still a great challenge to obtain reliable recognition result for the 2D images. Recently, with the emergence and easy availability of RGB-D equipment such as Kinect, this challenge could be relieved because the depth channel could bring more information. A very special and important case of object recognition is hand-held object recognition, as hand is a straight and natural way for both human-human interaction and human-machine interaction. In this paper, we study the problem of 3D object recognition by combining heterogenous features with different modalities and extraction techniques. For hand-craft feature, although it reserves the low-level information such as shape and color, it has shown weakness in representing hiconvolutionalgh-level semantic information compared with the automatic learned feature, especially deep feature. Deep feature has shown its great advantages in large scale dataset recognition but is not always robust to rotation or scale variance compared with hand-craft feature. In this paper, we propose a method to combine hand-craft point cloud features and deep learned features in RGB and depth channle. First, hand-held object segmentation is implemented by using depth cues and human skeleton information. Second, we combine the extracted hetegerogenous 3D features in different stages using linear concatenation and multiple kernel learning (MKL). Then a training model is used to recognize 3D handheld objects. Experimental results validate the effectiveness and gerneralization ability of the proposed method.

Citation Download Citation

Xiong Lv, Shuang Wang, Xiangyang Li, and Shuqiang Jiang "Combining heterogenous features for 3D hand-held object recognition", Proc. SPIE 9273, Optoelectronic Imaging and Multimedia Technology III, 92732I (29 October 2014); https://doi.org/10.1117/12.2071381

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available