An Effective Automatic Image Annotation Model Via Attention Model and Data Equilibrium

Vatani, Amir; Ahvanooey, Milad Taleby; Rahimi, Mostafa

doi:10.14569/ijacsa.2018.090338

Cited by 8 publications

(5 citation statements)

References 42 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Furthermore, the F1 score obtained by our method is at least 10% higher than that obtained by other recent studies such as GCN (2020) [73], SSL-AWF (2021) [81], and MVRSC (2021) [82]. Now, if we look at the scenario of 374 concepts, we can see that our proposed method has surpassed all other methods except for that of Vatani et al (2020) [85]. However, if we consider the method of Vatani et al in terms of N+, we can see that our method outperforms it by eight concepts.…”

Section: Scenario 2: Comparing Our Methods To the State Of The Artcontrasting

confidence: 42%

Deep Convolutional Neural Network with KNN Regression for Automatic Image Annotation

et al. 2021

View full text Add to dashboard Cite

Automatic image annotation is an active field of research in which a set of annotations are automatically assigned to images based on their content. In literature, some works opted for handcrafted features and manual approaches of linking concepts to images, whereas some others involved convolutional neural networks (CNNs) as black boxes to solve the problem without external interference. In this work, we introduce a hybrid approach that combines the advantages of both CNN and the conventional concept-to-image assignment approaches. J-image segmentation (JSEG) is firstly used to segment the image into a set of homogeneous regions, then a CNN is employed to produce a rich feature descriptor per area, and then, vector of locally aggregated descriptors (VLAD) is applied to the extracted features to generate compact and unified descriptors. Thereafter, the not too deep clustering (N2D clustering) algorithm is performed to define local manifolds constituting the feature space, and finally, the semantic relatedness is calculated for both image–concept and concept–concept using KNN regression to better grasp the meaning of concepts and how they relate. Through a comprehensive experimental evaluation, our method has indicated a superiority over a wide range of recent related works by yielding F1 scores of 58.89% and 80.24% with the datasets Corel 5k and MSRC v2, respectively. Additionally, it demonstrated a relatively high capacity of learning more concepts with higher accuracy, which results in N+ of 212 and 22 with the datasets Corel 5k and MSRC v2, respectively.

show abstract

Section: Scenario 2: Comparing Our Methods To the State Of The Artcontrasting

confidence: 42%

Deep Convolutional Neural Network with KNN Regression for Automatic Image Annotation

et al. 2021

View full text Add to dashboard Cite

show abstract

“…Ivasic-Kos et al [12] proposed a framework based on semantic and discriminative classification. Vatani et al [13] come up with a deep learning model with feature extraction method using three phases: a extraction of feature, a generation of tag, and an annotating an image. This model attempts to unravel the problem of imbalanced data in image annotation.…”

Section: Nearest Neighbor-based Modelmentioning

confidence: 99%

Study of Various Types of Data Annotation

Ningthoujam

Singh

2021

Advances in Intelligent Systems and Computing

View full text Add to dashboard Cite

Labeling of digital data has made it easier for an algorithm to understand and process the dataset using machine learning techniques. There are various methods that are used to add the necessary information to gather data and achieve a perfect ground truth. The objective of this paper is to discuss the types of digital data annotation viz image, audio, and video. After discussing the various types, the paper focuses on different models used for annotating and how it has been evaluated on various dataset.

show abstract

“…Few famous novels are taken as sample and trained LSTM to generate a long sequence of sentence which is similar writing of the novel. Amir Vatani et al [11] proposed image annotation with low-level and high-level feature extraction, tag generation and annotation which optimizes the common issue in image annotation called as semantic gap. Jacobian matrix creation in visual control systems is a challenging one which leads to give opportunity for estimation error, observation error and filter error.…”

Section: Related Workmentioning

confidence: 99%

“…Later deep learning model has been introduced and it is proven in various research papers [42,43,44] as better model because the network learns itself. Recently, deep learning methods are rocking in this research area and itslowly transforms the annotation into captioning.In deep learning, annotation models can be classified as annotating with tags [10,11,14,20,29,29,34], finding sequence of words [5,9,13,15,22,24,25,27],captioning [17,18,19,26,21]and classification [6,7,14,16].In the first model, assigning tags to images based on extracted features or objects in the image. Related tags are identified and make it as a sequence of tags whereas the third model combined related tags with Natural Language processing as meaningful sentence called as captioning.In deep learning annotation models, Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) perform well to encode features of image and decode the features into the natural language representation [2].Later Long-Short-Term-Memory (LSTM) has been introduced to conserve the dependency for future reference and good in natural language generating [13] which RNN can't.…”

Section: Introductionmentioning

confidence: 99%

CSL Net: Convoluted SE and LSTM Blocks Based Network for Automatic Image Annotation

2019

IJEAT

View full text Add to dashboard Cite

Due to advancement of multimedia technology, availability and usage of image and video data is enormous. For indexing and retrieving those data, there is a need for an efficient technique. Now, Automatic keyword generation for images is a focussed research which has lot of attractions. In general, conventional auto annotation methods having lesser performance over deep learning methods. The annotation is transformed as captioning in deep learning models. In this paper, we propose a new model CSL Net (CSLN) as a combination of convoluted squeeze and excitation block with Bi-LSTM blocks to predict tags for images. The proposed model is evaluated using the various benchmark datasets like CIFAR10, Corel5K, ESPGame and IAPRTC12. It is observed that, the proposed work yields better results compared to that of the existing methods in term of precision, recall and accuracy.

show abstract

An Effective Automatic Image Annotation Model Via Attention Model and Data Equilibrium

Cited by 8 publications

References 42 publications

Deep Convolutional Neural Network with KNN Regression for Automatic Image Annotation

Deep Convolutional Neural Network with KNN Regression for Automatic Image Annotation

Study of Various Types of Data Annotation

CSL Net: Convoluted SE and LSTM Blocks Based Network for Automatic Image Annotation

Contact Info

Product

Resources

About