Learning Semantic Graphics Using Convolutional Encoder–Decoder Network for Autonomous Weeding in Paddy

Adhikari, Shyam Prasad; Yang, Heechan; Kim, Hyongsuk

doi:10.3389/fpls.2019.01404

Cited by 52 publications

(33 citation statements)

References 44 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…While [21], [22] used CNN-based semantic segmentation to discriminate crops, weeds and background, the actual lines of crop are not extracted. In our previous work we presented that CNN can directly be trained to learn the concept of a crop line using ''semantic graphics'' [23], as shown in Fig. 1.…”

Section: Related Researchmentioning

confidence: 99%

Deep Neural Network-Based System for Autonomous Navigation in Paddy Field

Adhikari

Kim

2020

IEEE Access

Self Cite

View full text Add to dashboard Cite

This paper presents a novel vision based approach for detecting rows of crop in paddy field. The precise detection of crop row enables a farm-tractor to autonomously navigate the field for successful inter-row weeding. While prior works on crop row detection rely primarily on various image based features, a deep neural network based approach for learning semantic graphics to directly extract the crop rows from an input image is used in this work. A deep convolutional encoder decoder network is trained to detect the crop lines using semantic graphics. The detected crop lines are then used to derive control signal for steering the tractor autonomously in the field. The results demonstrate that the proposed method is able to detect the rows of paddy accurately and enable the tractor to navigate autonomously along the crop rows even with a simple proportional only controller.INDEX TERMS Convolutional encoder-decoder network, crop line detection, semantic graphics, vision based control.

show abstract

Section: Related Researchmentioning

confidence: 99%

Deep Neural Network-Based System for Autonomous Navigation in Paddy Field

Adhikari

Kim

2020

IEEE Access

Self Cite

View full text Add to dashboard Cite

show abstract

“…Simultaneously, the MSFFU-Net contained extended two skip connections: one was that each set of feature maps generated on the encoder path are concatenated to the corresponding feature maps on the decoder path; the other was that transferring of max pooling indices values from the encoder to the decoder to locate contour position information of multi-scale retinal vessel features for higher segmentation accuracy [ 32 ]. The feature maps of the upsampling operation were merged with the corresponding output feature maps of the two extended skip modules [ 33 ], as shown in Figure 6 .…”

Section: Proposed Methodsmentioning

confidence: 99%

A Multi-Scale Feature Fusion Method Based on U-Net for Retinal Vessel Segmentation

Yang

Liu

Ren

et al. 2020

Entropy

View full text Add to dashboard Cite

Computer-aided automatic segmentation of retinal blood vessels plays an important role in the diagnosis of diseases such as diabetes, glaucoma, and macular degeneration. In this paper, we propose a multi-scale feature fusion retinal vessel segmentation model based on U-Net, named MSFFU-Net. The model introduces the inception structure into the multi-scale feature extraction encoder part, and the max-pooling index is applied during the upsampling process in the feature fusion decoder of an improved network. The skip layer connection is used to transfer each set of feature maps generated on the encoder path to the corresponding feature maps on the decoder path. Moreover, a cost-sensitive loss function based on the Dice coefficient and cross-entropy is designed. Four transformations—rotating, mirroring, shifting and cropping—are used as data augmentation strategies, and the CLAHE algorithm is applied to image preprocessing. The proposed framework is tested and trained on DRIVE and STARE, and sensitivity (Sen), specificity (Spe), accuracy (Acc), and area under curve (AUC) are adopted as the evaluation metrics. Detailed comparisons with U-Net model, at last, it verifies the effectiveness and robustness of the proposed model. The Sen of 0.7762 and 0.7721, Spe of 0.9835 and 0.9885, Acc of 0.9694 and 0.9537 and AUC value of 0.9790 and 0.9680 were achieved on DRIVE and STARE databases, respectively. Results are also compared to other state-of-the-art methods, demonstrating that the performance of the proposed method is superior to that of other methods and showing its competitive results.

show abstract

“…In some regions, labor cost makes up more than half of the total production cost, e.g., 60% in Norway ( Xiong et al, 2019 ). Furthermore, there is a decline in interest of joining the agriculture industry among the new generation of workers ( Adhikari et al, 2019 ). Under all these challenges the food industry must keep up with the demands of the ever-growing population.…”

Section: Introductionmentioning

confidence: 99%

“…Due to outstanding performances of DCNNs in computer vision tasks, robotics and unmanned systems are now faster and more reliable than ever. Which in turn has allowed their adoption into many real-life applications like the detection of crop rows, weeds, and seeding beds in fields of maize and rice ( Guerrero et al, 2017 ; Adhikari et al, 2019 ; Ma et al, 2019 ).…”

Section: Introductionmentioning

confidence: 99%

DAM: Hierarchical Adaptive Feature Selection Using Convolution Encoder Decoder Network for Strawberry Segmentation

et al. 2021

Self Cite

View full text Add to dashboard Cite

Autonomous harvesters can be used for the timely cultivation of high-value crops such as strawberries, where the robots have the capability to identify ripe and unripe crops. However, the real-time segmentation of strawberries in an unbridled farming environment is a challenging task due to fruit occlusion by multiple trusses, stems, and leaves. In this work, we propose a possible solution by constructing a dynamic feature selection mechanism for convolutional neural networks (CNN). The proposed building block namely a dense attention module (DAM) controls the flow of information between the convolutional encoder and decoder. DAM enables hierarchical adaptive feature fusion by exploiting both inter-channel and intra-channel relationships and can be easily integrated into any existing CNN to obtain category-specific feature maps. We validate our attention module through extensive ablation experiments. In addition, a dataset is collected from different strawberry farms and divided into four classes corresponding to different maturity levels of fruits and one is devoted to background. Quantitative analysis of the proposed method showed a 4.1% and 2.32% increase in mean intersection over union, over existing state-of-the-art semantic segmentation models and other attention modules respectively, while simultaneously retaining a processing speed of 53 frames per second.

show abstract

Learning Semantic Graphics Using Convolutional Encoder–Decoder Network for Autonomous Weeding in Paddy

Cited by 52 publications

References 44 publications

Deep Neural Network-Based System for Autonomous Navigation in Paddy Field

Deep Neural Network-Based System for Autonomous Navigation in Paddy Field

A Multi-Scale Feature Fusion Method Based on U-Net for Retinal Vessel Segmentation

DAM: Hierarchical Adaptive Feature Selection Using Convolution Encoder Decoder Network for Strawberry Segmentation

Contact Info

Product

Resources

About