HIDRA: Head Initialization across Dynamic targets for Robust Architectures

Drumond, Rafael Rêgo; Brinkmeyer, Lukas; Grabocka, Josif; Schmidt-Thieme, Lars

doi:10.1137/1.9781611976236.45

Cited by 3 publications

(1 citation statement)

References 7 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…One of the first methods to attempt few-shot learning on homogeneous predictors was chameleon [4] which used a convolutional encoder to align tasks from similar domains to a common attribute space before utilizing gradient-based few-shot methods. Similarly, other works tried to learn across tasks with a varied label spaces [9,32]. Finally, Iwata et al [19] proposed a model that uses deep set [53] based blocks to compute a task-embedding over predictor and targets of training samples (support data) which then can be combined with new unlabeled samples (query data) to perform a classification or regression without the need of retraining or fine-tuning, similar to neighbor-based approaches (we will refer to this method as HetNet throughout the rest of the paper).…”

Section: Related Workmentioning

confidence: 99%

Few-Shot Forecasting of Time-Series with Heterogeneous Channels

Brinkmeyer¹,

Drumond²,

Burchert³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

Learning complex time series forecasting models usually requires a large amount of data, as each model is trained from scratch for each task/data set. Leveraging learning experience with similar datasets is a well-established technique for classification problems called few-shot classification. However, existing approaches cannot be applied to timeseries forecasting because i) multivariate time-series datasets have different channels and ii) forecasting is principally different from classification. In this paper we formalize the problem of few-shot forecasting of timeseries with heterogeneous channels for the first time. Extending recent work on heterogeneous attributes in vector data, we develop a model composed of permutation-invariant deep set-blocks which incorporate a temporal embedding. We assemble the first meta-dataset of 40 multivariate time-series datasets and show through experiments that our model provides a good generalization, outperforming baselines carried over from simpler scenarios that either fail to learn across tasks or miss temporal information.

show abstract

Section: Related Workmentioning

confidence: 99%