2019
DOI: 10.1109/tmm.2018.2888798
|View full text |Cite
|
Sign up to set email alerts
|

A Hierarchical Approach for Associating Body-Worn Sensors to Video Regions in Crowded Mingling Scenarios

Abstract: We address the complex problem of associating several wearable devices with the spatio-temporal region of their wearers in video during crowded mingling events using only acceleration and proximity. This is a particularly important first step for multi-sensor behavior analysis using video and wearable technologies, where the privacy of the participants must be maintained. Most state-of-the-art works using these two modalities perform their association manually, which becomes practically unfeasible as the numbe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(6 citation statements)
references
References 32 publications
0
6
0
Order By: Relevance
“…A comparison between the three tested architectures reveals that the best performances are obtained by the fully-conv model when trained using our novel RTL, with an AVG auROC of 76.3%. The full ROC curves for this particular model are presented in Figure 6 c, together with the ROC curves for the baseline method of Shigeta et al [ 38 ] in Figure 6 a and Cabrera-Quiros et al [ 42 ] in Figure 6 b, which only manage to achieve AVG auROCs of 55.6% and 52.1%, respectively. In terms of best AVG auROC, the best model was trained on a combination of Easy and Hard negatives; while adopting harder negatives samples during training may improve the performances for SSSA auROC, the degradation over other scores leads to a lower AVG auROC.…”
Section: Experiments and Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…A comparison between the three tested architectures reveals that the best performances are obtained by the fully-conv model when trained using our novel RTL, with an AVG auROC of 76.3%. The full ROC curves for this particular model are presented in Figure 6 c, together with the ROC curves for the baseline method of Shigeta et al [ 38 ] in Figure 6 a and Cabrera-Quiros et al [ 42 ] in Figure 6 b, which only manage to achieve AVG auROCs of 55.6% and 52.1%, respectively. In terms of best AVG auROC, the best model was trained on a combination of Easy and Hard negatives; while adopting harder negatives samples during training may improve the performances for SSSA auROC, the degradation over other scores leads to a lower AVG auROC.…”
Section: Experiments and Resultsmentioning
confidence: 99%
“…The results of these experiments are presented in Figure 8 , which shows that our method outperforms the baselines for every case studied, in spite of a degradation of performances when the number of people increases from 2 to 10. Despite Cabrera et al [ 42 ] being explicitly designed to deal with mingling events and crowded scenes, when their algorithm is applied to the short video clips, their performances drastically drop, with results that are almost on par with random guesses in the hardest scenario (i.e., DSSA, SSSA in Figure 8 c).…”
Section: Experiments and Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…A crucial assumption made in many former multimodal datasets [1,6,7] is that the association of video data to the wearable modality can be manually performed. Few works [37,38] have tried to address this issue but using movement cues alone to associate the modalities is challenging as conversing individuals are mostly stationary. This remains a significant and open question for future large scale deployable multimodal systems.…”
Section: Conclusion Discussion and Limitationsmentioning
confidence: 99%
“…In practice, it is preferable to avoid this step by using a fully automated multimodal association approach. However this remains an open research challenge [37,38]. During the event, participants mingled freely-they were allowed to carry bags or use mobile phones.…”
Section: Data Association and Participant Protocolmentioning
confidence: 99%