Lithology identification of rocks is an important part in the field of oil and gas exploration, mineral exploration, and geological analysis. How to accomplish rock classification is a key issue for the further development of the geology industry. The current main method for classifying rock pictures containing background is to select sample points or disregard the disturbance of the background. For more accurate classification, the rock part extraction method for rock images containing boundaries is designed to eliminate the influence of background. First, the rock parts are extracted based on the image gradient information and color information, respectively. Then, the two images are intersected to realize the refinement of pixel-level information to obtain a pure rock image. Ensemble ResNet18 (ERN18) is designed as an image classification model. It contains basic blocks to reduce the loss of features during the training process. The method breaks the neglect of most previous studies on background interference. The effect of misclassification in certain regions on the results is eliminated by ensemble learning based on the voting method. The classification results are further improved. Compared with the effects of LeNet, AlexNet, and ResNet, ERN18 has achieved significant results.
Object tracking is a crucial research area within the field of intelligent transportation, providing a vital foundation for anomalous behavior analysis and traffic statistics. Although pedestrian detectors have shown impressive results, leading to the advancement of detection-based tracking methods, target association in complex scenarios remains a difficult and less efficient task due to the lack of feature robustness in the presence of partial occlusions. In the proposed tracking method, we extract convolutional features on each entire object and its local blocks, segmented by the superpixel algorithm. Aiming to emphasize the global and local information respectively, the global features for each entire object are extracted from the last layer of the backbone network, while local features are derived from a specific intermediate layer of the backbone network. The association between tracked targets and detected pedestrian candidates relies on fused similarity degrees. Furthermore, we use the transformer's self-attention mechanism to predict features for the current frame based on the information within past frames, aiming to eliminate the effects of target appearance variations. Additionally, we remove redundant background pixels in the detected rectangles of pedestrian candidates by using a background modeling algorithm. Experimental results demonstrate that the tracker proposed in this paper outperforms other trackers across five publicly available datasets, indicating its effectiveness and potential for further development.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.