2021
DOI: 10.1109/access.2021.3096530
|View full text |Cite
|
Sign up to set email alerts
|

Tunable U-Net: Controlling Image-to-Image Outputs Using a Tunable Scalar Value

Abstract: Image-to-image conversion tasks are more accurate and sophisticated than ever thanks to advances in deep learning. However, since typical deep learning models are trained to perform only one task, multiple trained models are required to perform each task even if they are related to each other. For example, the popular image-to-image convolutional neural network, U-Net, is normally trained for a single task. Based on U-Net, this study proposes a model that outputs variable results using only one trained model. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(2 citation statements)
references
References 40 publications
0
2
0
Order By: Relevance
“…Subsequently, we explored the strategies to tune the network with incident and azimuthal angles. Following the successful experience of [32], we extended U-Net by the implementation of the tuning network (Figure 3) with trainable weights. Our tuning network brings the tunable scalar parameter to the main U-Net model controlling the second convolutional layer so that the physical loss may simultaneously be minimized for different illumination directions.…”
Section: Training Processmentioning
confidence: 99%
“…Subsequently, we explored the strategies to tune the network with incident and azimuthal angles. Following the successful experience of [32], we extended U-Net by the implementation of the tuning network (Figure 3) with trainable weights. Our tuning network brings the tunable scalar parameter to the main U-Net model controlling the second convolutional layer so that the physical loss may simultaneously be minimized for different illumination directions.…”
Section: Training Processmentioning
confidence: 99%
“…However, the illumination parameters such as illumination angle or exposure wavelength are inherently scalar floating-point numbers, and their representation in an image-based format is not immediately straightforward. The authors [33] explored strategies to continuously change the network output by external scalar parameters. Building upon this successful approach, we extended the U-Net scheme by implementing a tuning network with trainable weights.…”
Section: Training Processmentioning
confidence: 99%