Auditory analysis is an essential method that is used to recognize voice identity in court investigations. However, noise will interfere with auditory perception. Based on this, we selected white noise, pink noise, and speech noise in order to design and conduct voice identity perception experiments. Meanwhile, we explored the impact of the noise type and frequency distribution on voice identity perception. The experimental results show the following: (1) in high signal-to-noise ratio (SNR) environments, there is no significant difference in the impact of noise types on voice identity perception; (2) in low SNR environments, the perceived result of speech noise is significantly different from that of white noise and pink noise, and the interference is more obvious; (3) in the speech noise with a low SNR (−8 dB), the voice information contained in the high-frequency band of 2930~6250 Hz is helpful for achieving accuracy in voice identity perception. These results show that voice identity perception in a better voice transmission environment is mainly based on the acoustic information provided by the low-frequency and medium-frequency bands, which concentrate most of the energy of the voice. As the SNR gradually decreases, a human’s auditory mechanism will automatically expand the receiving frequency range to obtain more effective acoustic information from the high-frequency band. Consequently, the high-frequency information ignored in the objective algorithm may be more robust with respect to identity perception in our environment. The experimental studies not only evaluate the quality of the case voice and control the voice recording environment, but also predict the accuracy of voice identity perception under noise interference. This research provides the theoretical basis and data support for applying voice identity perception in forensic science.