The accurate and robust detection of the audio has been widely grown as the speech technology in the area of audio forensics, speech recognition, and so on. However, in real time, it is a challenge to deal with the massive data arriving from distributed sources. Thus, the study introduces a method that effectively deals with the data from the distributed sources using the map-reduce framework (MRF). The map and reduce function in MRF aim at feature extraction and audio classification. The robust classification using the proposed grasshopper-ride optimization algorithm-based support vector machine (G-ROA-based SVM) uses the features, such as multiple kernel Mel frequency cepstral coefficients, spectral flux, spectral kurtosis, and delta-amplitude modulation spectrogram. The proposed G-ROA is the integration of ROA and grasshopper optimization algorithm in tuning the optimal weights of SVM and also, the kernel function in SVM is modified using the Gaussian radial basis function, Gaussian kernel, and polynomial kernels. The experimentation of the proposed method is done using two datasets, namely TUT sound event 2017 dataset and ESC dataset. TUT sound event 2017 dataset consists of eight audio recordings from a single acoustic scene. ESC dataset consists of three parts and 252,400 recordings. The analysis reveals that the proposed audio classification acquired the maximal accuracy of 0.96, minimal false alarm rate, and false rejection rate of 0.022 and 0.0119, respectively. This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.