Learning a deep predictive coding network for a semi-supervised 3D-hand pose estimation

Bulugu, Isack; Banzi, Jamal; Huang, Shiliang; Ye, Zhongfu

COSTECH Repository Home
→
University of Dar-es-salaam
→
College of Information and Communication Technology
→
Department of Electronics and Telecommunications Engineering
→
View Item

dc.creator	Bulugu, Isack
dc.creator	Banzi, Jamal
dc.creator	Huang, Shiliang
dc.creator	Ye, Zhongfu
dc.date	2020-04-07T05:18:22Z
dc.date	2020-04-07T05:18:22Z
dc.date	2020-03-27
dc.date.accessioned	2021-05-03T13:17:02Z
dc.date.available	2021-05-03T13:17:02Z
dc.identifier	IEEE
dc.identifier	2329-9274
dc.identifier	http://hdl.handle.net/20.500.11810/5411
dc.identifier	10.1109/JAS.2020.1003090
dc.identifier.uri	http://hdl.handle.net/20.500.11810/5411
dc.description	In this paper we present a CNN based approach for a real time 3D-hand pose estimation from the depth sequence. Prior discriminative approaches have achieved remarkable success but are facing two main challenges: Firstly, the methods are fully supervised hence require large numbers of annotated training data to extract the dynamic information from a hand representation. Secondly, unreliable hand detectors based on strong assumptions or a weak detector which often fail in several situations like complex environment and multiple hands. In contrast to these methods, this paper presents an approach that can be considered as semi-supervised by performing predictive coding of image sequences of hand poses in order to capture latent features underlying a given image without supervision. The hand is modelled using a novel latent tree dependency model ( LDTM ) which transforms internal joint location to an explicit representation. Then the modeled hand topology is integrated with the pose estimator using data dependent method to jointly learn latent variables of the posterior pose appearance and the pose configuration respectively. Finally, an unsupervised error term which is a part of the recurrent architecture ensures smooth estimations of the final pose. Experiments on three challenging public datasets, ICVL, MSRA, and NYU demonstrate the significant performance of the proposed method which is comparable or better than state-of-the-art approaches.
dc.publisher	IEEE
dc.subject	Hand pose estimation; Convolutional neural networks; Recurrent neural networks; Human-machine interaction; semi supervised learning
dc.title	Learning a deep predictive coding network for a semi-supervised 3D-hand pose estimation
dc.type	Journal Article, Peer Reviewed

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

Department of Electronics and Telecommunications Engineering [156]

Show simple item record

Search COSTECH

Advanced Search

Browse

All of COSTECH
This Collection
- By Issue Date
- Authors
- Titles
- Subjects

Learning a deep predictive coding network for a semi-supervised 3D-hand pose estimation

Files in this item

This item appears in the following Collection(s)

Search COSTECH

Browse

All of COSTECH

This Collection

My Account