A multimodal detection system with complementary capabilities for efficient detection was developed for impurity detection. The system consisted of a visible light camera, a multispectral camera, image correction and registration algorithms. It can obtain spectral features and color features at the same time and has higher spatial resolution than a single spectral camera. This system was applied to detect impurities in Pu’er tea to verify its high efficiency. The spectral and color features of each pixel in the images of Pu’er tea were obtained by this system and used for pixel classification. The experimental results showed that the accuracy of a support vector machine (SVM) model based on combined features was 93%, which was 7% higher than that based on spectral features only. By applying a median filtering algorithm and a contour detection algorithm to the label matrix extracted from pixel-classified images, except hair, eight impurities were detected successfully. Moreover, taking advantage of the high resolution of a visible light camera, small impurities could be clearly imaged. By comparing the segmented color image with the pixel-classified image, small impurities such as hair could be detected successfully. Finally, it was proved that the system could obtain multiple images to allow a more detailed and comprehensive understanding of the detected items and had an excellent ability to detect small impurities.