2023
DOI: 10.1109/tmm.2022.3177894
|View full text |Cite
|
Sign up to set email alerts
|

FoleyGAN: Visually Guided Generative Adversarial Network-Based Synchronous Sound Generation in Silent Videos

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 11 publications
(6 citation statements)
references
References 45 publications
0
6
0
Order By: Relevance
“…Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN): CNN and RNN, as fundamental deep learning models, are widely used in Foley production. For instance, Andrew Owens in [17] employed a trained CNN to predict the sounds that occurred during video recording. Understanding these basic neural network models provides a profound insight into the related algorithmic models.…”
Section: Basic Methods and Principles Mainly Involved By Foleymentioning
confidence: 99%
See 4 more Smart Citations
“…Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN): CNN and RNN, as fundamental deep learning models, are widely used in Foley production. For instance, Andrew Owens in [17] employed a trained CNN to predict the sounds that occurred during video recording. Understanding these basic neural network models provides a profound insight into the related algorithmic models.…”
Section: Basic Methods and Principles Mainly Involved By Foleymentioning
confidence: 99%
“…This evaluation revealed that the author's binary-based model approach had the most significant advantages. Additionally, [17] and [13] both used the Inception Score (IS) for assessment. The core idea of IS involves utilizing a pretrained image classification model, such as the Inception network, to classify the generated images.…”
Section: Dataset Evaluation Methodsmentioning
confidence: 99%
See 3 more Smart Citations