2022
DOI: 10.1007/978-3-031-20059-5_33
|View full text |Cite
|
Sign up to set email alerts
|

Speaker-Adaptive Lip Reading with User-Dependent Padding

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 15 publications
(4 citation statements)
references
References 54 publications
0
4
0
Order By: Relevance
“…The key idea of prompting is to modify the inputs rather than network parameters (Li and Liang 2021). Following (Jia et al 2022;Kim, Kim, and Ro 2022), we propose to transform the padding, an input to convolutional layers, to be our prompt. In convolutional layers, padding is usually employed to maintain or control the size of the output feature maps, such as zero padding and reflect padding.…”
Section: Prompting For Test-time Personalizationmentioning
confidence: 99%
“…The key idea of prompting is to modify the inputs rather than network parameters (Li and Liang 2021). Following (Jia et al 2022;Kim, Kim, and Ro 2022), we propose to transform the padding, an input to convolutional layers, to be our prompt. In convolutional layers, padding is usually employed to maintain or control the size of the output feature maps, such as zero padding and reflect padding.…”
Section: Prompting For Test-time Personalizationmentioning
confidence: 99%
“…Model Reprogramming, also known as Adversarial Reprogramming, is a form of adversarial attack on ML models that transforms the inputs for model to change the model's behavior to a desired adversarial behavior [Elsayed et al, 2018]. Transforming the inputs for images, also known as visual prompting, is usually achieved by adding values to whole or parts of the image or padding the image with custom values [Elsayed et al, 2018;Zheng et al, 2021;Jia et al, 2022;Kim et al, 2022;Bahng et al, 2022;Zhang et al, 2022].…”
Section: Model Reprogrammingmentioning
confidence: 99%
“…Prompt-based learning, widely recognized in natural language processing (NLP), employs specific text inputs to amplify the proficiency of NLP models (Lester, Al-Rfou, and Constant 2021), (Li and Liang 2021). This concept has been extended to computer vision as visual prompts, showcasing promising applications (Kim, Kim, and Ro 2022), (Lin et al 2023), (Chen et al 2023). While methods like fine-tuning pre-trained vision models with visual prompts have demonstrated competitive results across various tasks with minimal resources (Jia et al 2022).…”
Section: Prompt Learningmentioning
confidence: 99%