‘Owl’ and ‘Lizard’: patterns of head pose and eye pose in driver gaze classification

Fridman, Lex; Lee, Joonbum; Reimer, Bryan; Victor, Trent

doi:10.1049/iet-cvi.2015.0296

Cited by 88 publications

(95 citation statements)

References 16 publications

(30 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…They can be further divided into 2 categories coarsely, i.e. geometry distribution based methods [4] [39][40][41][42][43] and 3D facial model [5][6][7][8][9][10] [44][45][46][47][48][49] based methods.…”

Section: Related Workmentioning

confidence: 99%

“…By looking for the projection relation between a 3D facial model and a 2D face image, head pose angles can be calculated from the elements in the rotation matrix directly (see Section III for details). Mbouna et al [5], Fridman et al [6] and Tawari et al [7] solved the rotation matrix to estimate the head pose according to a 3D facial model and corresponding 2D facial feature points directly. Bar et al [8] provided some 3D facial templates to match the 3D point cloud obtained from the depth values so as to estimate head poses by using an iterative closest point (ICP) algorithm.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Single image-based head pose estimation with spherical parametrization and 3D morphing

Yuan

Hou

et al. 2020

Pattern Recognition

View full text Add to dashboard Cite

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Single image-based head pose estimation with spherical parametrization and 3D morphing

Yuan

Hou

et al. 2020

Pattern Recognition

View full text Add to dashboard Cite

“…appearance descriptor), which allows for an increased number of gaze zones, but not at the expense of performance, as shown by evaluating on a dataset composed of multiple drivers. Another learning based method is the work presented by Fridman et al [15] where the evaluations are done on a significantly large dataset, but the design of the features to represent the state of the head and eyes are what is causing their classifier to over fit to user based models and to not generalize well with global based models.…”

Section: A On Gaze Estimationmentioning

confidence: 99%

Dynamics of Driver's Gaze: Explorations in Behavior Modeling and Maneuver Prediction

Martin

Vora

Yuen

et al. 2018

IEEE Trans. Intell. Veh.

View full text Add to dashboard Cite

The study and modeling of driver's gaze dynamics is important because, if and how the driver is monitoring the driving environment is vital for driver assistance in manual mode, for take-over requests in highly automated mode and for semantic perception of the surround in fully autonomous mode. We developed a machine vision based framework to classify driver's gaze into context rich zones of interest and model drivers gaze behavior by representing gaze dynamics over a time period using gaze accumulation, glance duration and glance frequencies.As a use case, we explore the driver's gaze dynamic patterns during maneuvers executed in freeway driving, namely, left lane change maneuver, right lane change maneuver and lane keeping. It is shown that condensing gaze dynamics into durations and frequencies leads to recurring patterns based on driver activities. Furthermore, modeling these patterns show predictive powers in maneuver detection up to a few hundred milliseconds a priori.

show abstract

“…Manually annotating specific epochs of driving, as the prior studies have done, is no longer sufficient for understanding the complexities of human behavior in the context of autonomous vehicle technology (i.e., driver glance or body position over thousands of miles of Autopilot use). For example, one of many metrics that are important to understanding driver behavior is momentby-moment detection of glance region [17], [18] (see §I-C). In order to accurately extract this metric from the 2.2 billion frames of face video without the use of computer vision would require an immense investment in manual annotation, assuming the availability of an efficient annotation tool that is specifically designed for the manual glance region annotation task and can leverage distributed, online, crowdsourcing of the annotation task.…”

Section: A Naturalistic Driving Studiesmentioning

confidence: 99%

MIT Advanced Vehicle Technology Study: Large-Scale Naturalistic Driving Study of Driver Behavior and Interaction With Automation

et al. 2019

Self Cite

View full text Add to dashboard Cite

Today, and possibly for a long time to come, the full driving task is too complex an activity to be fully formalized as a sensing-acting robotics system that can be explicitly solved through model-based and learning-based approaches in order to achieve full unconstrained vehicle autonomy. Localization, mapping, scene perception, vehicle control, trajectory optimization, and higher-level planning decisions associated with autonomous vehicle development remain full of open challenges. This is especially true for unconstrained, real-world operation where the margin of allowable error is extremely small and the number of edge-cases is extremely large. Until these problems are solved, human beings will remain an integral part of the driving task, monitoring the AI system as it performs anywhere from just over 0% to just under 100% of the driving. The governing objectives of the MIT Advanced Vehicle Technology (MIT-AVT) study are to (1) undertake large-scale real-world driving data collection that includes high-definition video to fuel the development of deep learning based internal and external perception systems, (2) gain a holistic understanding of how human beings interact with vehicle automation technology by integrating video data with vehicle state data, driver characteristics, mental models, and self-reported experiences with technology, and (3) identify how technology and other factors related to automation adoption and use can be improved in ways that save lives. In pursuing these objectives, we have instrumented 23 Tesla Model S and Model X vehicles, 2 Volvo S90 vehicles, 2 Range Rover Evoque, and 2 Cadillac CT6 vehicles for both long-term (over a year per driver) and medium term (one month per driver) naturalistic driving data collection. Furthermore, we are continually developing new methods for analysis of the massive-scale dataset collected from the instrumented vehicle fleet. The recorded data streams include IMU, GPS, CAN messages, and high-definition video streams of the driver face, the driver cabin, the forward roadway, and the instrument cluster (on select vehicles). The study is on-going and growing. To date, we have 122 participants, 15,610 days of participation, 511,638 miles, and 7.1 billion video frames. This paper presents the design of the study, the data collection hardware, the processing of the data, and the computer vision algorithms currently being used to extract actionable knowledge from the data. 01 231 4523 67 89 8 %& 'ÿ )*+ ,-,.,*/ÿ 0123 45 1 '142-,5 ,67ÿ 8+ *97 :; <=>ÿ @AB; CDÿ ; AE =F; GH ÿIJ KFL; M NM OFB; ÿ =F>DH ÿPQR SPT ULM VGLDH ÿPWW XGCM NY GDH ÿWZ [M Y GDÿ =LM VGBH ÿQPPR SI\ XM =GAÿ ] LF@GDH ÿJPPÿa b b a cd :; <=>ÿ =F; Fÿ NAY Y GN; M ABÿ M Dÿ ABeAM Bef ÿ :; F; M D; M NDÿ

show abstract

‘Owl’ and ‘Lizard’: patterns of head pose and eye pose in driver gaze classification

Cited by 88 publications

References 16 publications

Single image-based head pose estimation with spherical parametrization and 3D morphing

Single image-based head pose estimation with spherical parametrization and 3D morphing

Dynamics of Driver's Gaze: Explorations in Behavior Modeling and Maneuver Prediction

MIT Advanced Vehicle Technology Study: Large-Scale Naturalistic Driving Study of Driver Behavior and Interaction With Automation

Contact Info

Product

Resources

About