Chassis Assembly Detection and Identification Based on Deep Learning Component Instance Segmentation

Liu, Guixiong; He, Binyuan; Liu, Siyuang; Huang, Jian

doi:10.3390/sym11081001

Cited by 6 publications

(4 citation statements)

References 44 publications

(53 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Moreover, the speed problem is mainly based on practical applications. The atrous convolution architecture eliminates part of the CNN pooling layer while replacing the convolutional layer with a cascade or parallel atrous convolutional layer, enabling the analysis of the feature map at multiple arbitrary scales, and thus significantly improving the segmentation accuracy [11][12][13] and providing the possibility of detecting applications in the field of low power consumption. For obtaining a more accurate and faster fewer-parameters model as well as a method to achieve online machine vision and identification, in this study, the weight optimization technology https://doi.org/10.1371/journal.pone.0246093.g001…”

Section: Introductionmentioning

confidence: 99%

Fast semantic segmentation method for machine vision inspection based on a fewer-parameters atrous convolution neural network

Huang

Liu

2021

PLoS ONE

Self Cite

View full text Add to dashboard Cite

Owing to the recent development in deep learning, machine vision has been widely used in intelligent manufacturing equipment in multiple fields, including precision-manufacturing production lines and online product-quality inspection. This study aims at online Machine Vision Inspection, focusing on the method of online semantic segmentation under complex backgrounds. First, the fewer-parameters optimization of the atrous convolution architecture is studied. Atrous spatial pyramid pooling (ASPP) and residual network (ResNet) are selected as the basic architectures of ηseg and ηmain, respectively, which indicate that the improved proportion of the participating input image feature is beneficial for improving the accuracy of feature extraction during the change of the number and dimension of feature maps. Second, this study proposes five modified ResNet residual building blocks, with the main path having a 3 × 3 convolution layer, 2 × 2 skip path, and pooling layer with ls = 2, which can improve the use of image features. Finally, the simulation experiments show that our modified structure can significantly decrease segmentation time Tseg from 719 to 296 ms (decreased by 58.8%), with only a slight decrease in the intersection-over-union from 86.7% to 86.6%. The applicability of the proposed machine vision method was verified through the segmentation recognition of the China Yuan (CNY) for the 2019 version. Compared with the conventional method, the proposed model of semantic segmentation visual detection effectively reduces the detection time while ensuring the detection accuracy and has a significant effect of fewer-parameters optimization. This slows for the possibility of neural network detection on mobile terminals.

show abstract

Section: Introductionmentioning

confidence: 99%

Fast semantic segmentation method for machine vision inspection based on a fewer-parameters atrous convolution neural network

Huang

Liu

2021

PLoS ONE

Self Cite

View full text Add to dashboard Cite

show abstract

“…On the other hand, semantic segmentation can recognize the type of the object and divide the actual area at the pixel level, as well as implement certain machine vision detection functions, such as positioning and recognition [3]. As we start from image classification, move to object detection, and finally reach semantic segmentation, the accuracy of the output range and position information improves [4]. In the same manner, the recognition precision increases from the image-level to the pixel-level.…”

Section: Introductionmentioning

confidence: 99%

Semantic Segmentation under a Complex Background for Machine Vision Detection Based on Modified UPerNet with Component Analysis Modules

Huang

Liu

Wang

2020

Mathematical Problems in Engineering

Self Cite

View full text Add to dashboard Cite

Semantic segmentation with convolutional neural networks under a complex background using the encoder-decoder network increases the overall performance of online machine vision detection and identification. To maximize the accuracy of semantic segmentation under a complex background, it is necessary to consider the semantic response values of objects and components and their mutually exclusive relationship. In this study, we attempt to improve the low accuracy of component segmentation. The basic network of the encoder is selected for the semantic segmentation, and the UPerNet is modified based on the component analysis module. The experimental results show that the accuracy of the proposed method improves from 48.89% to 55.62% and the segmentation time decreases from 721 to 496 ms. The method also shows good performance in vision-based detection of 2019 Chinese Yuan features.

show abstract

“…Image classification is an image-level visual recognition task that aims to classify each visual image into one of the pre-defined semantic categories; object detection is an instance-level visual recognition task that locates all the objects in a visual image and recognizes their semantic categories; semantic segmentation is a pixel-level visual recognition task that aims to assign a semantic category label to each and every pixel of an image. The progress in this research field enable a wide range of applications in computer vision, including autonomous vehicles [25][26][27][28][29][30], the analysis of medical images [31][32][33][34][35], the surveillance of manufacturing [37][38][39][40][41][42], construction [43][44][45][46], agriculture [47][48][49][50][51][52] and retail [53][54][55][56], and augmented and virtual reality in entertainment [57][58][59][60]. The technical methods of visual recognition can be broadly…”

Section: Visual Recognitionmentioning

confidence: 99%

“…object detection [21,22] and semantic segmentation [23,24]. In practice, visual recognition (i.e., classification, detection and segmentation) plays a significant role in various computer vision scenarios and applications including transportation (e.g., autonomous vehicles [25,26], drones [27,28] and robots [29,30]), healthcare (e.g., analysis of CT [31] and MRI [32,33] images, cancer detection [34,35] and patient movement analysis [36]), manufacturing (e.g., defect inspection [37,38], scene text recognition [39,40] and product assembly [41,42]), construction (e.g., predictive maintenance [43,44] and personal protective equipment detection [45,46]), agriculture (e.g., crop and livestock surveillance [47,48], automatic weeding [49,50] and insect detection [51,52]), retail (e.g., self-checkout [53,54] and surveillance for unmanned supermarkets [55,56]) and entertainment (e.g., augmented reality [57,58] and virtual reality [59,60]).…”

mentioning

confidence: 99%

Transductive transfer learning for visual recognition

Huang¹

View full text Add to dashboard Cite

has given invaluable advice on almost all aspects of my Ph.D. study. I would also express great gratitude towards all the lab-mates in MICL, CSL and SCALE Labs and all the members in Prof Lu's research group for the enthusiastic discussions and priceless support in the last four years. Dayan, Aoran and Pengdeng brought me a lot of precious memory in my Ph.D. journey. Thank Jingyi, Kaiwen, Zichen, Yun and Xueying for their hardworking while we collaborated on projects and researches. I enjoy the good works we have done and the team-working time.Thanks to all of my thesis committee members, Prof. Lin Guosheng, Prof. Liu Ziwei and Prof. Chen Change Loy for the time, kind suggestions and feedback for my qualification examination and thesis. You have all been very patient and kind throughout the whole process, and your encouragement and suggestions have helped refine my work greatly.Last but most importantly, I wish to express the deepest gratitude to my family for their constant and unconditional love and support. Thank you for being always there for me, for listening, for enlightening and for giving me the strength and courage to overcome difficulties and pursue greatness.xi

show abstract

Chassis Assembly Detection and Identification Based on Deep Learning Component Instance Segmentation

Cited by 6 publications

References 44 publications

Fast semantic segmentation method for machine vision inspection based on a fewer-parameters atrous convolution neural network

Fast semantic segmentation method for machine vision inspection based on a fewer-parameters atrous convolution neural network

Semantic Segmentation under a Complex Background for Machine Vision Detection Based on Modified UPerNet with Component Analysis Modules

Transductive transfer learning for visual recognition

Contact Info

Product

Resources

About