Hyperspectral remote sensing images with high spatial resolution (H 2 imagery) have an abundant spatial-spectral information, holding tremendous potential for remote sensing fine-grained monitoring and classification. However, challenges such as high spatial heterogeneity, severe intra-class spectral variability and poor signal-to-noise ratio especially in unmanned aerial vehicle (UAV) hyperspectral imagery constrain and hinder the performance of fine-grained classification. Convolutional neural network (CNN) emerges as a formidable and excellent tool for image mining and feature extraction, offering effective utility for land cover classification. In this paper, a parallel convolutional classification network model based on multi-modal filters (including ICA-2D-FPN and SA-3D-CNN branching structures) PCCN-MSS is proposed for precise H 2 imagery classification. The ICA-2D-FPN branch integrates independent component analysis (ICA) into 2D-CNN to extract the multispatial scale and spectral information of H 2 imagery by Feature Pyramid Networks (FPN), meanwhile the SA-3D-CNN branch is designed to extract the spatial and spectral information by combining spectral attention (SA) mechanism and 3D-CNN. Taking hyperspectral imagery of unmanned aerial vehicles containing vegetation and artifactual material ground as an example, the proposed PCCN-MSS model achieves an overall accuracy (OA) of 78.18%, which outperforms by 9.58% than the compared methods. The proposed PCCN-MSS method can mitigate the classification issues of severe salt-and-pepper noise and inaccurate boundary, delivering more satisfactory classification results with robust classification performance and remarkable advantages for H 2 imagery.