2021
DOI: 10.1109/access.2021.3109815
|View full text |Cite
|
Sign up to set email alerts
|

TML: A Triple-Wise Multi-Task Learning Framework for Distracted Driver Recognition

Abstract: We propose a multi-task learning framework for improving the performance of vision-based deep-learning approaches for driver distraction recognition. The most popular tool so far for solving this task is convolutional neural networks (CNNs) that have proven to be strongly biased toward local features. Such bias causes CNNs to neglect global structural information, adversely affecting the robustness of the distracted driver recognition task. To solve this problem, we generate positive and negative samples of ea… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(4 citation statements)
references
References 53 publications
0
4
0
Order By: Relevance
“…This paper examines different deep learning classification models for distracted driver classification [55][56][57][58][59] and proposes a model that improves performance and provides recommendations. We explored the performance of different models: ResNet50, VGG16, MobileNet, and Inception.…”
Section: Discussionmentioning
confidence: 99%
“…This paper examines different deep learning classification models for distracted driver classification [55][56][57][58][59] and proposes a model that improves performance and provides recommendations. We explored the performance of different models: ResNet50, VGG16, MobileNet, and Inception.…”
Section: Discussionmentioning
confidence: 99%
“…[24] proposed a coarse temporal attention network by exploiting spatiotemporal attention to model driver activity, utilizing an attention mechanism to generate high-level action-specific contextual information. [40] adopted a multitask learning approach and constructed triplets of images to improve the performance of vision-based driver distraction recognition. The image triplets were used to force networks to explore global information.…”
Section: Related Workmentioning
confidence: 99%
“…Jegham et al [57] proposed a novel LSTM-DCNN complex model that can process consecutive frames and different views to detect distracteddriver behaviors. Liu et al [58] proposed a multitask learning framework for recognizing driver distraction that combines raw image, positive sample, and negative sample to acquire more features for discriminating different behaviors. MobileVGG [59] is a state-of-the-art classifier that uses separable convolution to accomplish accurate and real-time classification.…”
Section: Introductionmentioning
confidence: 99%