2022
DOI: 10.21203/rs.3.rs-2046359/v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Developing a deep neural network-based encoder-decoder framework in automatic image captioning systems

Abstract: This study is concerned with the development of a deep neural network-based framework, including a “convolutional neural network (CNN)” encoder and a “Long Short-Term Memory (LSTM)” decoder in an automatic image captioning application. The proposed model percepts information points in a picture and their relationship to one another in the viewpoint. Firstly, a CNN encoder excels at retaining spatial information and recognizing objects in images by extracting features to produce vocabulary that describes the ph… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 23 publications
0
0
0
Order By: Relevance