2022
DOI: 10.11591/ijece.v12i3.pp3092-3103
|View full text |Cite
|
Sign up to set email alerts
|

Video captioning in Vietnamese using deep learning

Abstract: <p><span>With the development of today's society, demand for applications using digital cameras jumps over year by year. However, analyzing large amounts of video data causes one of the most challenging issues. In addition to storing the data captured by the camera, intelligent systems are required to quickly analyze the data to correct important situations. In this paper, we use deep learning techniques to build automatic models that describe movements on video. To solve the problem, we use three … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 23 publications
0
2
0
Order By: Relevance
“…), the result is shown in Table 4. We use Nguyen's dataset [38], [39] including 18,108 questions and paragraphs for the improved BM25 algorithm. The results of comparisons between the two models are shown in Table 5.…”
Section: Results and Analysismentioning
confidence: 99%
“…), the result is shown in Table 4. We use Nguyen's dataset [38], [39] including 18,108 questions and paragraphs for the improved BM25 algorithm. The results of comparisons between the two models are shown in Table 5.…”
Section: Results and Analysismentioning
confidence: 99%
“…The second module recognized the activities performed in each window using a CNN model. The calculations performed on the CNN parameters required 77.06 MB of memory [23]- [25].…”
Section: Introductionmentioning
confidence: 99%