“…Generally, LDVs are based on the principle of laser interferometry, making LDVs highly sensitive to object surface reflections, environmental factors, and the mutual locations of the projection laser and the detection interferometer modules [ 9 ]. Recently, an emerging technology, image-based sound recovery from high-speed videos, has drawn much attention [ 10 , 11 , 12 , 13 , 14 , 15 , 16 , 17 ]. In these systems, a highly developed phase-based algorithm is applied to extract sounds from the high-speed videos that can show subtle motions [ 11 ].…”