2020
DOI: 10.1049/iet-cvi.2019.0500
|View full text |Cite
|
Sign up to set email alerts
|

Architecture to improve the accuracy of automatic image annotation systems

Abstract: Automatic image annotation (AIA) is an image retrieval mechanism to extract relative semantic tags from visual content. So far, the improvement of accuracy in newly developed such methods have been about 1 or 2% in the F1‐score and the architectures seem to have room for improvement. Therefore, the authors designed a more detailed architecture for AIA and suggested new algorithms for its main parts. The proposed architecture has three main parts: feature extraction, learning, and annotation. They designed a no… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 10 publications
(4 citation statements)
references
References 50 publications
0
4
0
Order By: Relevance
“…Vatani From Table 2, it appears that our proposed segmentation-based AIA method outperforms the majority of the stated related works in both scenarios of 274 and 260 concepts If we take as an instance the top two F1 scores yielded by the related works Khatchatoorian et al (2020) [72] and CNN-THOP (2020) [74] in the scenario of 260 concepts, we can clearly see that the outcomes of our method exceed those of both methods by 5% at least. Furthermore, the F1 score obtained by our method is at least 10% higher than that obtained by other recent studies such as GCN (2020) [73], SSL-AWF (2021) [81], and MVRSC (2021) [82].…”
Section: Regiomentioning
confidence: 77%
See 1 more Smart Citation
“…Vatani From Table 2, it appears that our proposed segmentation-based AIA method outperforms the majority of the stated related works in both scenarios of 274 and 260 concepts If we take as an instance the top two F1 scores yielded by the related works Khatchatoorian et al (2020) [72] and CNN-THOP (2020) [74] in the scenario of 260 concepts, we can clearly see that the outcomes of our method exceed those of both methods by 5% at least. Furthermore, the F1 score obtained by our method is at least 10% higher than that obtained by other recent studies such as GCN (2020) [73], SSL-AWF (2021) [81], and MVRSC (2021) [82].…”
Section: Regiomentioning
confidence: 77%
“…By understanding the logic that connects different concepts, the system became able to learn concepts regardless of their narrow use. On the other hand, the idea in the work of Khatchatoorian et al (2020) [72] revolves around employing CNN as a black box and letting it learn everything by itself. However, we have taken advantage of both the methods by applying a CNN to obtain a rich set of features representing the concepts and employing KNN regression to understand how these concepts are related.…”
Section: Regiomentioning
confidence: 99%
“…Upon examining the values of N + , it is evident that both Query2Label and the proposed network have predicted more than 10 labels not previously annotated by the GAP-based method, demonstrating their proficiency in annotating tags that occur less frequently. Except for certain approaches like [52] that have attempted to annotate low-frequency tags by selecting a distinct threshold for each one, extracting the image features associated with these tags using traditional CNNs is challenging. However, it is important to consider that determining the optimal threshold each label is impacted by the training data, which can constrain the model's generalizability.…”
Section: Comparisons Of Different Attention Modulesmentioning
confidence: 99%
“…The interconnection mechanisms are so basic (method invocation); thus, complex interconnections are so hard in these methods. In a nutshell, there is no clear image of system architecture before component creation [ 42 ]. Component-based SA relieves this modeling trouble and develops reusable off-the-shelf component-based heterogeneous systems.…”
Section: Software Architecture Designmentioning
confidence: 99%