2020 International Symposium on Autonomous Systems (ISAS) 2020
DOI: 10.1109/isas49493.2020.9378877
|View full text |Cite
|
Sign up to set email alerts
|

Long Short-term Memory Network Based Fatigue Detection with Sequential Mouth Feature

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 15 publications
0
5
0
Order By: Relevance
“…It enhances detection feature stability and achieves a precision of 94.44% by employing intermediate frame filtering to prevent duplicate detections. To better accommodate individual differences and enhance the model's generalization ability, Fei et al [5] proposed a fatigue detection system based on sequential mouth features with a long short-term memory (LSTM) network. To mitigate the impact of lighting conditions and facial expression changes, Histogram of Oriented Gradient (HOG) features and a Support Vector Machine (SVM) classifier are utilized for face detection.…”
Section: Single-feature Detection Methodsmentioning
confidence: 99%
“…It enhances detection feature stability and achieves a precision of 94.44% by employing intermediate frame filtering to prevent duplicate detections. To better accommodate individual differences and enhance the model's generalization ability, Fei et al [5] proposed a fatigue detection system based on sequential mouth features with a long short-term memory (LSTM) network. To mitigate the impact of lighting conditions and facial expression changes, Histogram of Oriented Gradient (HOG) features and a Support Vector Machine (SVM) classifier are utilized for face detection.…”
Section: Single-feature Detection Methodsmentioning
confidence: 99%
“…The CNN extracts spatial features from individual frames, and the LSTM network analyzes the temporal features of driver actions between the adjacent frames. For example, some authors attempted to employ this CNN-LSTM network to build a DDD system depending only on the analysis of yawning features [19][20][21]. Most of them used a public dataset named YawDD.…”
Section: Spatiotemporal-based Systemsmentioning
confidence: 99%
“…One of the main differences between these studies is the type of inputs to the models. Xie et al [19] use the whole frames as inputs, Zhang et al [20] use frame edges, and Fei et al [21] focus on the extracted mouth regions from the drivers' faces. Other researchers utilized the CNN-LSTM network to learn the spatiotemporal feature of drivers' eyes.…”
Section: Spatiotemporal-based Systemsmentioning
confidence: 99%
See 2 more Smart Citations