This paper presents the Plug-Load Appliance Identification Dataset (PLAID), a labelled dataset containing records of the electrical voltage and current of domestic electrical appliances obtained at a high sampling frequency (30 kHz). The dataset contains 1876 records of individually-metered appliances from 17 different appliance types (e.g., refrigerators, microwave ovens, etc.) comprising 330 different makes and models, and collected at 65 different locations in Pittsburgh, Pennsylvania (USA). Additionally, PLAID contains 1314 records of the combined operation of 13 of these appliance types (i.e., measurements obtained when multiple appliances were active simultaneously). Identifying electrical appliances based on electrical measurements is of importance in demand-side management applications for the electrical power grid including automated load control, load scheduling and nonintrusive load monitoring. This paper provides a systematic description of the measurement setup and dataset so that it can be used to develop and benchmark new methods in these and other applications, and so that extensions to it can be developed and incorporated in a consistent manner. 110 130 max 2 0
RMSThe main contributions of this paper are that it:The complete PLAID dataset and all mentioned scripts are available in 5 . In the same repository, code written to capture the data can be found. The files are two scripts, namely 'collecting_data.vi' (written with LabVIEW) and 'collecting_data.m' (written in MATLAB).
Predicting the dependencies between observations from multiple time series is critical for applications such as anomaly detection, financial risk management, causal analysis, or demand forecasting. However, the computational and numerical difficulties of estimating time-varying and high-dimensional covariance matrices often limits existing methods to handling at most a few hundred dimensions or requires making strong assumptions on the dependence between series. We propose to combine an RNN-based time series model with a Gaussian copula process output model with a low-rank covariance structure to reduce the computational complexity and handle non-Gaussian marginal distributions. This permits to drastically reduce the number of parameters and consequently allows the modeling of time-varying correlations of thousands of time series. We show on several realworld datasets that our method provides significant accuracy improvements over state-of-the-art baselines and perform an ablation study analyzing the contributions of the different components of our model.
Shapelets are discriminative subsequences extracted from time-series data. Classifiers using shapelets have proven to achieve performances competitive to state-of-the-art methods, while enhancing the model's interpretability. While a lot of research has been done for univariate time-series shapelets, extensions for the multivariate setting have not yet received much attention. To extend shapelets-based classification to a multidimensional setting, we developed a novel architecture for shapelets learning, by embedding them as trainable weights in a multi-layer Neural Network. We also investigated the introduction of a novel learning strategy for the shapelets, comprising of two additional terms in the optimization goal, to retrieve a reduced set of uncorrelated shapelets. This paper describes the proposed architecture and presents results on ten publicly available benchmark datasets, as well as a comparison with existing stateof-the-art methods. Moreover, the proposed optimization objective leads the model to automatically select smaller sets of uncorrelated shapelets, thus requiring no additional manual optimization on typically important hyper-parameters such as number and length of shapelets. The results show how the proposed approach achieves competitive performance across the datasets, and always leads to a significant reduction in the number of shapelets used. This can make it faster for a domain expert to match shapelets to real patterns, thus enhancing the interpretability of the model. Finally, since the shapelets learnt during training can be extracted from the model they can
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.