Training a CNN to robustly segment the human body parts in range image sequences

Par Conseil national de recherches du Canada

DOI	Trouver le DOI : https://doi.org/10.1117/12.2508903
Auteur	Rechercher : Seoud, Lama¹; Rechercher : Boisvert, Jonathan¹; Rechercher : Drouin, Marc-Antoine¹; Rechercher : Picard, Michel¹; Rechercher : Godin, Guy¹
Affiliation	Conseil national de recherches du Canada. Technologies numériques
Format	Texte, Article
Conférence	Optical Data Science II, Feb. 6th, 2019, San Francisco, California
Sujet	image segmentation; sensors; image sensors; cameras; imaging systems; 3D modeling; protection systems
Résumé	Range sensors have drawn much interest for human activity related research since they provide explicit 3D information about the shape that is invariant to clothing, skin color and illumination changes. However, triangulationbased systems like structured-light sensors generate occlusions in the image when parts of the scene cannot be seen by both the projector and the camera. Those occlusions, as well as missing data points and measurement noise, depend on the structured-light system design. These artifacts add a level of difficulty to the task of human body segmentation that is typically not addressed in the literature. In this work, we design a segmentation model that is able to reason about 3D spatial information, to identify the different body parts in motion and is robust to artifacts inherent to the structured-light system, such as triangulation occlusions, noise and missing data. First, we build the first realistic sensor-specific training set by closely simulating the actual acquisition scenario with the same intrinsic parameters as our sensor and the artifacts it generates. Second, we adapt a state-of-the-art fully convolutional network to range images of the human body in order for it to transfer its learning toward 3D spatial information instead of light intensities. Third, we quantitatively demonstrate the importance of simulating sensor-specific artifacts in the training set to improve the robustness of the segmentation of actual range images. Finally, we show the capability of the model to accurately segment human body parts on real range image sequences acquired by our structured light sensor, with high inter-frame consistency and in real-time.
Date de publication	2019-03-07
Maison d’édition	Society of Photo-Optical Instrumentation Engineers (SPIE)
Dans	Optical Data Science II (7 mars 2019) : 1–12.
Série	Proceedings of SPIE 10937.
Langue	anglais
Publications évaluées par des pairs	Oui
Exporter la notice	Exporter en format RIS
Signaler une correction	Signaler une correction (s'ouvre dans un nouvel onglet)
Identificateur de l’enregistrement	b49fbf2b-efd1-40b5-9b22-9c067b0c41df
Enregistrement créé	2021-07-16
Enregistrement modifié	2021-07-21

Date de modification :: 2024-07-18