We present a high-speed single-mirror double-pass coincident schlieren system and corresponding algorithms for the visualization of acoustic waves and recovery of their associated audio signals. Schlieren systems are extensively used to visualize strong shockwaves, such as those from supersonic motion or explosions. Recently, they have also been used to visualize lower amplitude non-linear acoustic phenomena, such as the weak shockwaves arising from impact events including hand claps, belt snaps, and towel cracks. Time-invariant sounds produced by loudspeakers have also been imaged, in one case leading to frequency analysis, although these have been limited to high-frequency signals at very high sound pressure levels. The research presented here shifts the focus from sound-field visualization towards audio signal recovery. A comprehensive exploration of several parameters for imaging sound sources, including frequency, wave form, and amplitude, is presented. In addition, we address for the first time the recovery of phase information, which would be essential for speech intelligibility, and the more general case of non-contact sound field reconstruction. Through image and signal processing, it is shown that audio signals can be recovered from high-speed schlieren video whose acoustic waves appear to be below the limit of visibility, and were previously deemed unrecoverable by virtue of their frequency and sound pressure level. This includes sounds at frequencies and loudnesses relevant for human hearing, producing the first 'schlieren microphone'.