2023
DOI: 10.1117/1.jmi.10.1.014005
|View full text |Cite
|
Sign up to set email alerts
|

CaraNet: context axial reverse attention network for segmentation of small medical objects

Abstract: Purpose: Segmenting medical images accurately and reliably is important for disease diagnosis and treatment. It is a challenging task because of the wide variety of objects' sizes, shapes, and scanning modalities. Recently, many convolutional neural networks have been designed for segmentation tasks and have achieved great success. Few studies, however, have fully considered the sizes of objects; thus, most demonstrate poor performance for small object segmentation. This can have a significant impact on the ea… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
17
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2
2

Relationship

2
5

Authors

Journals

citations
Cited by 48 publications
(17 citation statements)
references
References 45 publications
(66 reference statements)
0
17
0
Order By: Relevance
“…The cost volume method can still be improved by learning depth plane distribution rather than using linear distribution directly such as using sinusoidal activation function [33], which will be considered in our future work. Additionally, addressing the challenge from moving objects, such as surgical tools within the scene, numerous segmentation methods [34][35][36][37][38] have been designed to segment and recognize medical objects. Incorporating these segmentation techniques into WS-SfMLearner to alleviate dynamic uncertainties and refine depth prediction precision for the surgical background would an interesting direction.…”
Section: Discussionmentioning
confidence: 99%
“…The cost volume method can still be improved by learning depth plane distribution rather than using linear distribution directly such as using sinusoidal activation function [33], which will be considered in our future work. Additionally, addressing the challenge from moving objects, such as surgical tools within the scene, numerous segmentation methods [34][35][36][37][38] have been designed to segment and recognize medical objects. Incorporating these segmentation techniques into WS-SfMLearner to alleviate dynamic uncertainties and refine depth prediction precision for the surgical background would an interesting direction.…”
Section: Discussionmentioning
confidence: 99%
“…Lou et al proposed CaraNet 30 . CaraNet 30 fuses shallow and deep features through a context axial reverse attention module to enhance the learning of global and local semantics. However, it only combines dual‐scale features without utilizing multiscale information.…”
Section: Related Workmentioning
confidence: 99%
“…To address the difficulties in the segmentation process, researchers have proposed various network architectures for optimization. Lou et al proposed CaraNet 30 . CaraNet 30 fuses shallow and deep features through a context axial reverse attention module to enhance the learning of global and local semantics.…”
Section: Related Workmentioning
confidence: 99%
“…To address these uncertainties, segmentation networks [8][9] can be used to filter the background and foreground in pixel-level detail. We apply the Segment Anything Model (SAM) [10] to segment the surgical tool using a rough bounding box as input prompt, which provides a more accurate segmentation mask compared to some other CNNbased methods [11]. Our goal in this paper is to build a model to reconstruct the surgical scene and track the movement of surgical tools.…”
Section: Introductionmentioning
confidence: 99%