Artificial recognition of tomato diseases is often time-consuming, laborious and subjective. For tomato disease images, it is difficult to find small discriminative features between different tomato diseases, which can bring challenges to fine-grained visual categorization of tomato leaf-based images. Therefore, we propose a novel model, which consists of 3 networks, including a Location network, a Feedback network, and a Classification network, named LFC-Net. At the same time, a self-supervision mechanism is proposed in the model, which can effectively detect informative regions of tomato image without the need for manual annotation such as bounding boxes/parts. Based on the consideration of the consistency between category of the image and informativeness of the image, we design a novel training paradigm. The Location network of the model first detects informative regions in the tomato image, and optimizes iterations under the guidance of the Feedback network. Then, the Classification network uses informative regions proposed by the Location network and the full image of the tomato for classification. Our model can be regarded as a multi-network collaboration, and networks can progress together. Compared with the pre-trained model on ImageNet, our model achieves the most advanced performance in the tomato dataset, with accuracy up to 99.7%. This work demonstrates that our model has a high accuracy and has the potential to be applied to other vegetable and fruit datasets, which can provide a reference for the prevention and control of tomato diseases. INDEX TERMS Fine-grained visual categorization, multi-network, self-supervised, tomato diseases This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.