Multi-source remote sensing images have the characteristics of large differences in texture and gray level. Mismatch and low recognition accuracy are easy to occur in the process of identifying targets. Thus, in this paper, the target recognition algorithm of multi-source remote sensing image based on IoT vision is investigated. The infrared sensor and SAR radars are set in the visual perception layer of the iVIOT. The visual perception layer transmits the collected remote sensing image information to the application layer through the wireless networks. The data processing module in the application layer uses the normalized central moment idea to extract the features of multi-source remote sensing image. Contourlet two-level decomposition is performed on the image after feature extraction to realize multi-scale and multi-directional feature fusion. A two-step method of primary fineness is used to match the fused features and the random sampling consensus algorithm is used to eliminate false matches for obtaining the correct match pairs. After the image feature matching is completed, the BVM target detection operator is used to complete the target recognition of multi-source remote sensing image. Experimental results show that the use of the IoT to visually recognizing the desired remote sensing image target has low communication overhead, and the recognition reaches 99% accuracy.