What Matters For Meta-Learning Vision Regression Tasks?

Gao, Ning; Ziesche, Hanna; Vien, Ngo Anh; Volpp, Michael; Neumann, Gerhard

doi:10.1109/cvpr52688.2022.01436

Cited by 11 publications

(13 citation statements)

References 44 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Dataset and Experimental Setups. To evaluate task generalization in meta-learning settings, we use two rotation prediction datasets named as ShapeNet1D [18] and Pascal1D [71,79], the goal of both datasets is to predict an object's rotation relative to the canonical orientation. Each task is rotation regression for one object, where the model takes a 128×128 grey-scale image as the input, and the output is an azimuth angle normalized between [0, 10].…”

Section: Task Generalizationmentioning

confidence: 99%

“…By mixing query examples with support examples that have similar labels, C-Mixup outperforms all of the other approaches on both datasets, verifying its effectiveness of improving task generalization. [18], the goal of RCF-MNIST is to predict the angle of rotation for each object. As shown in Figure 3, we color each image with a color between red and blue.…”

Section: Task Generalizationmentioning

confidence: 99%

“…Directly applying mixup to input features and labels in regression tasks may yield arbitrarily incorrect labels. For example, as shown in Figure 1(a), ShapeNet1D pose prediction [18] aims to predict the current orientation of the object relative to its canonical orientation. We randomly select three mixing pairs and show the mixed images and labels in Figure 1(b), where only pair 1 exhibits reasonable mixing results.…”

Section: Introductionmentioning

confidence: 99%

“…Similar to synthetic datasets with subpopulation shifts in classification, e.g., ColoredMNIST [4], we build a synthetic regression dataset -RCFashion-MNIST (RCF-MNIST), which is illustrated in Figure 3. Built on the FashionMNIST[18], the goal of RCF-MNIST is to predict the angle of rotation for each object. As shown in Figure3, we color each image with a color between red and blue.…”

mentioning

confidence: 99%

“…We compute the mean and standard deviation for results of three seeds. We adopt the same preprocessing strategy to preprocess the ShapeNet1D dataset[18]. The ShapeNet1D dataset contains 27 categories with 60 objects per category.…”

mentioning

confidence: 99%

See 4 more Smart Citations

C-Mixup: Improving Generalization in Regression

Yao¹,

Wang²,

Zhang³

et al. 2022

Preprint

View full text Add to dashboard Cite

Improving the generalization of deep networks is an important open challenge, particularly in domains without plentiful data. The mixup algorithm improves generalization by linearly interpolating a pair of examples and their corresponding labels. These interpolated examples augment the original training set. Mixup has shown promising results in various classification tasks, but systematic analysis of mixup in regression remains underexplored. Using mixup directly on regression labels can result in arbitrarily incorrect labels. In this paper, we propose a simple yet powerful algorithm, C-Mixup, to improve generalization on regression tasks. In contrast with vanilla mixup, which picks training examples for mixing with uniform probability, C-Mixup adjusts the sampling probability based on the similarity of the labels. Our theoretical analysis confirms that C-Mixup with label similarity obtains a smaller mean square error in supervised regression and meta-regression than vanilla mixup and using feature similarity. Another benefit of C-Mixup is that it can improve out-of-distribution robustness, where the test distribution is different from the training distribution. By selectively interpolating examples with similar labels, it mitigates the effects of domain-associated information and yields domain-invariant representations. We evaluate C-Mixup on eleven datasets, ranging from tabular to video data. Compared to the best prior approach, C-Mixup achieves 6.56%, 4.76%, 5.82% improvements in in-distribution generalization, task generalization, and out-of-distribution robustness, respectively. Code is released at https://github.com/huaxiuyao/C-Mixup. * Equal contribution. This work was done when Yiping Wang was remotely co-mentored by Huaxiu Yao and Linjun Zhang.36th Conference on Neural Information Processing Systems (NeurIPS 2022).

show abstract

Section: Task Generalizationmentioning

confidence: 99%

Section: Task Generalizationmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

mentioning

confidence: 99%

mentioning

confidence: 99%

See 3 more Smart Citations