This study investigates the effects of audio processing and filtering strategies to enhance the performance of speech recognition systems in noisy environments. The focus is on the Short-Time Fourier Transform (STFT) operations applied to noisy audio files and noise reduction procedures. While STFT operations form the basis for detecting noise and analyzing the speech signal in the frequency domain, noise reduction steps involve threshold-based masking and convolution operations. The results obtained demonstrate a significant improvement in speech recognition accuracy in noisy environments through audio processing and filtering strategies. A detailed analysis of the graphs provides guidance for evaluating the effectiveness of noise reduction procedures and serves as a roadmap for future research. This study emphasizes the critical importance of audio processing and filtering strategies in improving the performance of speech recognition systems in noisy environments, laying a foundation for future studies.