2022
DOI: 10.48550/arxiv.2205.05949
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Automated Audio Captioning: an Overview of Recent Progress and New Challenges

Xinhao Mei,
Xubo Liu,
Mark D. Plumbley
et al.

Abstract: Automated audio captioning is a cross-modal translation task that aims to generate natural language descriptions for given audio clips. This task has received increasing attention with the release of freely available datasets in recent years. The problem has been addressed predominantly with deep learning techniques. Numerous approaches have been proposed, such as investigating different neural network architectures, exploiting auxiliary information such as keywords or sentence information to guide caption gen… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 53 publications
(95 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?