Proceedings of the 5th International Conference on Computer Science and Software Engineering 2022
DOI: 10.1145/3569966.3569990
|View full text |Cite
|
Sign up to set email alerts
|

CTI-GAN: Cross-Text-Image Generative Adversarial Network for Bidirectional Cross-modal Generation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 28 publications
0
1
0
Order By: Relevance
“…These methods leverage largescale unlabeled image and text data for pretraining and learning aligned representations for images and text. Some methods incorporate techniques like generative adversarial networks (GANs) and variational autoencoders (VAEs) to generate richer and continuous cross-modal representations [36,37]. Others introduce image and text reconstruction tasks in self-supervised learning to further enhance the model's learning capability [38,39].…”
Section: Cross-modal Correlation Algorithmmentioning
confidence: 99%
“…These methods leverage largescale unlabeled image and text data for pretraining and learning aligned representations for images and text. Some methods incorporate techniques like generative adversarial networks (GANs) and variational autoencoders (VAEs) to generate richer and continuous cross-modal representations [36,37]. Others introduce image and text reconstruction tasks in self-supervised learning to further enhance the model's learning capability [38,39].…”
Section: Cross-modal Correlation Algorithmmentioning
confidence: 99%