DI-UMONS : Dépôt institutionnel de l’université de Mons

Recherche transversale
Rechercher
(titres de publication, de périodique et noms de colloque inclus)
2017-05-21 - Article/Dans un journal avec peer-review - Anglais - 11 page(s)

Laraba Sohaib , Tilmanne Joëlle , Brahimi Mohammed, Dutoit Thierry , "3D Skeleton-Based Action Recognition by Representing Motion Capture Sequences as 2D-RGB" in Computer Animation & Virtual Worlds, 28, 3-4, e1782

  • Edition : John Wiley & Sons, Inc. - Engineering
  • Codes CREF : Sciences de l'ingénieur (DI2000), Informatique mathématique (DI1160)
  • Unités de recherche UMONS : Théorie des circuits et Traitement du signal (F105)
  • Instituts UMONS : Institut de Recherche en Technologies de l’Information et Sciences de l’Informatique (InforTech), Institut NUMEDIART pour les Technologies des Arts Numériques (Numédiart)
Texte intégral :

Abstract(s) :

(Anglais) In recent years, 3D skeleton-based action recognition has become a popular technique of action classification, thanks to development and availability of cheaper depth sensors. State-of-the-art methods generally represent motion sequences as high dimensional trajectories followed by a time-warping technique. These trajectories are used to train a classification model to predict the classes of new sequences. Despite the success of these techniques in some fields, particularly when the data used are captured by a high-precision motion capture system, action classification is still less successful than the field of image classification, especially with the advance of deep learning. In this paper, we present a new representation of motion sequences (Seq2Im—for sequence to image), which projects motion sequences onto the RGB domain. The 3D coordinates of joints are mapped to red, green, and blue values, and therefore, action classification becomes an image classification problem and algorithms for this field can be applied. This representation was tested with basic image classification algorithms (namely, support vector machine, k-nearest neighbor, and random forests) in addition to convolutional neural networks. Evaluation of the proposed method on standard 3D human action recognition datasets shows its potential for action recognition and outperforms most of the state-of-the-art results.

Identifiants :
  • DOI : 10.1002/cav.1782

Mots-clés :
  • (Anglais) 3D data representation
  • (Anglais) action recognition
  • (Anglais) convolutional neural networks
  • (Anglais) motion capture