“…However, if this is compared to our implementation on an Ubuntu i7 platform (without the application of the rules that increase robustness), our software acceleration method achieves a latency less than 40% of the lowest latency achieved in [ 1 ]. The face alignment applications ([ 8 , 9 , 10 , 12 ]) based on ERTs [ 14 ] achieve a relatively high speed (between 16 and 45 fps) but they concern different applications such as face recognition, pose estimation, etc., and some of them (e.g., [ 9 ]) align a smaller number of landmarks, which is a faster procedure. The yawning detection approaches [ 30 , 32 ] are based on CNNs and operate at a significantly smaller speed.…”