2020
DOI: 10.1016/j.compag.2020.105302
|View full text |Cite
|
Sign up to set email alerts
|

Fruit detection, segmentation and 3D visualisation of environments in apple orchards

Abstract: Robotic harvesting of fruits in orchards is a challenging task, since high density and overlapping of fruits and branches can heavily impact the success rate of robotic harvesting. Therefore, the vision system is demanded to provide comprehensive information of the working environment to guide the manipulator and gripping system to successful detach the target fruits. In this study, a deep learning based one-stage detector DaSNet-V2 is developed to perform the multi-task vision sensing in the working environme… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
75
0
4

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
3

Relationship

2
5

Authors

Journals

citations
Cited by 164 publications
(79 citation statements)
references
References 35 publications
0
75
0
4
Order By: Relevance
“…An improved light-weight one-stage instance segmentation network ’Mobile-DasNet’ is developed in this research work, to perform fruit recognition, as shown in Figure 3 . Compared to the previous network, DasNet [ 28 ], which applies resnet-50 [ 47 ] as the backbone and a three levels Feature Pyramid Network (FPN), the proposed Mobile-DasNet applies a light-weigth backbone ’MobileNet’ [ 48 ] and a two-levels FPN (receive feature maps from C4, and C5 levels) to improve its computational efficiency. The proposed Mobile-DasNet achieves a weight size of 20.5 MB and the average running speed of 63 FPS on an NVIDIA GTX-1070 GPU.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…An improved light-weight one-stage instance segmentation network ’Mobile-DasNet’ is developed in this research work, to perform fruit recognition, as shown in Figure 3 . Compared to the previous network, DasNet [ 28 ], which applies resnet-50 [ 47 ] as the backbone and a three levels Feature Pyramid Network (FPN), the proposed Mobile-DasNet applies a light-weigth backbone ’MobileNet’ [ 48 ] and a two-levels FPN (receive feature maps from C4, and C5 levels) to improve its computational efficiency. The proposed Mobile-DasNet achieves a weight size of 20.5 MB and the average running speed of 63 FPS on an NVIDIA GTX-1070 GPU.…”
Section: Methodsmentioning
confidence: 99%
“…Kang et al [ 27 ] introduced a novel multi-function neural network DasNet-v1 based on YOLO for real-time detection and semantic segmentation for both apples and branches in orchard environments. The detection and segmentation network with ResNet-101 backbone outperformed the corresponding task, while the network model with lightweight backbone also showed the best computation efficiency in the results.In the ensuing work [ 28 ], an enhanced deep neural network DasNet-v2 was developed, which achieved detection and instance segmentation on fruit and semantic segmentation on branches. The DasNet-v2 outperformed the previous neural network on the precision of apple detection and accuracy of semantic segmentation of branches and also applied instance segmentation on fruit as a new feature.…”
Section: Literature Reviewmentioning
confidence: 99%
“…The works of [17] and [18] applied FCN model to detect the guava fruits and cotton, respectively. In our previous works [19,20,21], a YOLO and SPRNet [22] architecture based multi-purpose deep CNN network model DASNet was developed to perform real-time detection, instance segmentation of fruits, and semantic segmentation of branch/trunk in orchard environments. More similar works of deep-learning based algorithms in machine vision in agriculture applications can also be found in the recent survey [23].…”
Section: Related Work A: Visual Perceptionmentioning
confidence: 99%
“…The workflow of the visual perception and modelling algorithm is shown in Figure 1. Firstly, DASNet [20,21] performs segmentation and detection on input RGB images to extract objects of interest from the working scenarios. Then, by combining the depth map which is collected by RGB-D camera, the processed information is used to modelling the working scenarios of the orchards.…”
Section: System Architecturementioning
confidence: 99%
See 1 more Smart Citation