2018
DOI: 10.1109/lsp.2018.2873892
|View full text |Cite
|
Sign up to set email alerts
|

MS-CapsNet: A Novel Multi-Scale Capsule Network

Abstract: Capsule network is a novel architecture to encode the properties and spatial relationships of the feature in the images, which shows encouraging results on image classification. However, the original capsule network is not suitable for some classification tasks that the detected object has complex internal representations. Hence, we propose Multi-Scale Capsule Network, a novel variation of capsule network to enhance the computational efficiency and representation capacity of capsule network. The proposed Multi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
122
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 210 publications
(124 citation statements)
references
References 10 publications
2
122
0
Order By: Relevance
“…The CapsNet uses a group of neurons as a capsule to replace a neuron in the traditional neural network. In addition, the capsule is a vector to represent internal properties that can be used to learn part-whole relationships between various entities, such as objects or object parts, to achieve equivariance [36] and can solve the problem of traditional neural networks using fully connected layers cannot efficiently capture the hierarchical structure of the entities in images to preserve the spatial information [50].…”
Section: Introductionmentioning
confidence: 99%
“…The CapsNet uses a group of neurons as a capsule to replace a neuron in the traditional neural network. In addition, the capsule is a vector to represent internal properties that can be used to learn part-whole relationships between various entities, such as objects or object parts, to achieve equivariance [36] and can solve the problem of traditional neural networks using fully connected layers cannot efficiently capture the hierarchical structure of the entities in images to preserve the spatial information [50].…”
Section: Introductionmentioning
confidence: 99%
“…The final reconstructed structure consists of 512, 1024, and 784 channels in three fully connected layers. We also selected two derivatives of the standard capsule network: MS-CapsNet 12 and TextCaps. 13 Both of these derivatives were applied to the Fashion-MNIST and CIFAR-10 datasets, and the main improvements from both variants also lie in their feature extraction processes.…”
Section: Resultsmentioning
confidence: 99%
“…For low-level capsules, location information [20] is "position-coded" by the active capsule. As the hierarchy is raised, more and more positional information is "rate-coded" [21], [22] in the actual value component of the capsule output vector.…”
Section: Capsule Networkmentioning
confidence: 99%