Real-Time Marker-Less Multi-person 3D Pose Estimation in RGB-Depth Camera Networks

Carraro, Marco; Munaro, Matteo; Burke, Jeff; Menegatti, Emanuele

doi:10.1007/978-3-030-01370-7_42

Cited by 24 publications

(28 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For all the six sequences, we computed the same performance metrics (average joint displacement error and standard deviation) on the estimates provided by other state-of-the-art approaches using as input exactly the same data from all the four available Kinects. The other comparison methods have been: (i) OpenPose [14] enriched with the data association and depth inference algorithms, (ii) moving average filtering (MAF), a common baseline approach already described in other similar state-of-the-art works such as [3], [19], and (iii) the standard version of OpenPTrack [3]. The obtained results are reported in Table I.…”

Section: Experiments and Resultsmentioning

confidence: 99%

“…However, the employment of a single sensor limits the reliability of the estimates, due to the fact that they are generally affected by occlusions and field-of-view limitations. A common solution seems to be connecting several cameras to form a common network [3], [20]. One of the biggest challenges when exploiting a multiple-camera network consists in the methodology used to merge information from different sensors.…”

Section: Related Workmentioning

confidence: 99%

“…Although a variety of different solutions to estimate 3D body poses from a single RGB-D sensor exists, in this work we used the one described in [3] which extends the singleview approach described in [14]. Despite our work not being constrained by the specific run-time single-view detection approach, the rationale behind our choice stands in its general applicability since its performances are independent from both the number of people to be tracked and the movements they perform.…”

Section: System Overviewmentioning

confidence: 99%

“…where w ≥ 1 ∈ R. 3 In this way, the joint-specific threshold is potentially capable of slowly adapting to the changes in joint speed. If the new detection distance for the m-th joint d m,t is larger than the just computed threshold th m,t , the detection is not directly rejected, but the corresponding measurement noise variance, after the joint confidence adaptation, is updated as follow:…”

Section: Outlier Filteringmentioning

confidence: 99%

“…In details, with respect to our previous work [3], this paper introduces four novel elements: (i) a new implementation of the Kalman filter considering in its state all the joint positions and velocities of the skeleton model (Sec. III-B), (ii) a joint confidence feedback to adjust the variance of the measurement noise process of the Kalman filter according to the confidence level associated to each single-view detection (Sec.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Real-time Tracking-by-Detection of Human Motion in RGB-D Camera Networks

Malaguti

Carraro

Guidolin

et al. 2019

2019 IEEE International Conference on Systems, Man and Cybernetics (SMC)

Self Cite

View full text Add to dashboard Cite

This paper presents a novel real-time tracking system capable of improving body pose estimation algorithms in distributed camera networks. The first stage of our approach introduces a linear Kalman filter operating at the body joints level, used to fuse single-view body poses coming from different detection nodes of the network and to ensure temporal consistency between them. The second stage, instead, refines the Kalman filter estimates by fitting a hierarchical model of the human body having constrained link sizes in order to ensure the physical consistency of the tracking. The effectiveness of the proposed approach is demonstrated through a broad experimental validation, performed on a set of sequences whose ground truth references are generated by a commercial markerbased motion capture system. The obtained results show how the proposed system outperforms the considered state-of-the-art approaches, granting accurate and reliable estimates. Moreover, the developed methodology constrains neither the number of persons to track, nor the number, position, synchronization, frame-rate, and manufacturer of the RGB-D cameras used. Finally, the real-time performances of the system are of paramount importance for a large number of real-world applications.1

show abstract

Section: Experiments and Resultsmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: System Overviewmentioning

confidence: 99%

Section: Outlier Filteringmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Real-time Tracking-by-Detection of Human Motion in RGB-D Camera Networks

Malaguti

Carraro

Guidolin

et al. 2019

2019 IEEE International Conference on Systems, Man and Cybernetics (SMC)

Self Cite

View full text Add to dashboard Cite

show abstract

Recognition and Localisation of Pointing Gestures Using a RGB-D Camera

Dhingra

Valli

Kunz

2020

Communications in Computer and Information Science

View full text Add to dashboard Cite

Non-verbal communication is part of our regular conversation, and multiple gestures are used to exchange information. Among those gestures, pointing is the most important one. If such gestures cannot be perceived by other team members, e.g. by blind and visually impaired people (BVIP), they lack important information and can hardly participate in a lively workflow. Thus, this paper describes a system for detecting such pointing gestures to provide input for suitable output modalities to BVIP. Our system employs an RGB-D camera to recognize the pointing gestures performed by the users. The system also locates the target of pointing e.g. on a common workspace. We evaluated the system by conducting a user study with 26 users. The results show that the system has a success rate of 89.59 and 79.92 % for a 2 × 3 matrix using the left and right arm respectively, and 73.57 and 68.99 % for 3 × 4 matrix using the left and right arm respectively.

show abstract

Pointing Gesture Based User Interaction of Tool Supported Brainstorming Meetings

Dhingra

Koutny

Günther

et al. 2020

Lecture Notes in Computer Science

View full text Add to dashboard Cite

This paper presents a brainstorming tool combined with pointing gestures to improve the brainstorming meeting experience for blind and visually impaired people (BVIP). In brainstorming meetings, BVIPs are not able to participate in the conversation as well as sighted users because of the unavailability of supporting tools for understanding the explicit and implicit meaning of the non-verbal communication (NVC). Therefore, the proposed system assists BVIP in interpreting pointing gestures which play an important role in non-verbal communication. Our system will help BVIP to access the contents of a Metaplan card, a team member in the brainstorming meeting is referring to by pointing. The prototype of our system shows that targets on the screen a user is pointing at can be detected with 80% accuracy.

show abstract

Real-Time Marker-Less Multi-person 3D Pose Estimation in RGB-Depth Camera Networks

Cited by 24 publications

References 27 publications

Real-time Tracking-by-Detection of Human Motion in RGB-D Camera Networks

Real-time Tracking-by-Detection of Human Motion in RGB-D Camera Networks

Recognition and Localisation of Pointing Gestures Using a RGB-D Camera

Pointing Gesture Based User Interaction of Tool Supported Brainstorming Meetings

Contact Info

Product

Resources

About