2021
DOI: 10.3390/s21227504
|View full text |Cite
|
Sign up to set email alerts
|

GourmetNet: Food Segmentation Using Multi-Scale Waterfall Features with Spatial and Channel Attention

Abstract: We propose GourmetNet, a single-pass, end-to-end trainable network for food segmentation that achieves state-of-the-art performance. Food segmentation is an important problem as the first step for nutrition monitoring, food volume and calorie estimation. Our novel architecture incorporates both channel attention and spatial attention information in an expanded multi-scale feature representation using our advanced Waterfall Atrous Spatial Pooling module. GourmetNet refines the feature extraction process by merg… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
13
1

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 16 publications
(14 citation statements)
references
References 44 publications
0
13
1
Order By: Relevance
“…In addition, a new database of food images with related recipes and nutritional information is collected in real-world hospital settings. Sharma et al [11] presented a GourmetNet for food segmentation with multi-scale feature representation. This network incorporates both spatial attention and channel attention using waterfall atrous spatial pooling module.…”
Section: Related Workmentioning
confidence: 99%
“…In addition, a new database of food images with related recipes and nutritional information is collected in real-world hospital settings. Sharma et al [11] presented a GourmetNet for food segmentation with multi-scale feature representation. This network incorporates both spatial attention and channel attention using waterfall atrous spatial pooling module.…”
Section: Related Workmentioning
confidence: 99%
“…However, their proposed method is more complex and less time-efficient compared to DeepLabV3+. Sharma et al [37] proposed a novel architecture named Gourmet-Net which incorporates both channel and spatial attention informations in an expanded multi-scale feature representation using advanced Waterfall Atrous Spatial Pooling (WASPv2) [38] module with channel and spatial attention mechanisms. GourmetNet achieves state-of-the-art performance on the UNIMIB2016 and UECFood-Pix datasets.…”
Section: Segmentation Model For Food Imagementioning
confidence: 99%
“…Achieving on these datasets a mIoU of 71.79% and 65.13% respectively. A more recent work, [39] proposed a Bayesian version of DeepLabv3+ and GourmetNet [37] to perform multi-class segmentation of foods.…”
Section: Segmentation Model For Food Imagementioning
confidence: 99%
See 1 more Smart Citation
“…GourmetNet achieved state-of-the-art performance on the UNIMIB2016 and UECFoodPix datasets, achieving an mIoU of 71.79% and 65.13% on these datasets, respectively. A more recent work, [ 39 ], proposed a Bayesian version of DeepLabv3+ and GourmetNet [ 37 ] to perform multi-class segmentation of foods.…”
Section: Related Workmentioning
confidence: 99%