Practical applications of nonparametric density estimators in more than three dimensions suffer a great deal from the well-known curse of dimensionality: convergence slows down as dimension increases. We show that one can evade the curse of dimensionality by assuming a simplified vine copula model for the dependence between variables. We formulate a general nonparametric estimator for such a model and show under high-level assumptions that the speed of convergence is independent of dimension. We further discuss a particular implementation for which we validate the high-level assumptions and establish its asymptotic normality. Simulation experiments illustrate a large gain in finite sample performance when the simplifying assumption is at least approximately true. But even when it is severely violated, the vine copula based approach proves advantageous as soon as more than a few variables are involved. Lastly, we give an application of the estimator to a classification problem from astrophysics.
In the last decade, simplified vine copula models have been an active area of research. They build a high dimensional probability density from the product of marginals densities and bivariate copula densities. Besides parametric models, several approaches to nonparametric estimation of vine copulas have been proposed. In this article, we extend these approaches and compare them in an extensive simulation study and a real data application. We identify several factors driving the relative performance of the estimators. The most important one is the strength of dependence. No method was found to be uniformly better than all others. Overall, the kernel estimators performed best, but do worse than penalized B-spline estimators when there is weak dependence and no tail dependence.
We describe the R package kdecopula (current version 0.9.2), which provides fast implementations of various kernel estimators for the copula density. Due to a variety of available plotting options it is particularly useful for the exploratory analysis of dependence structures. It can be further used for accurate nonparametric estimation of copula densities and resampling. The implementation features spline interpolation of the estimates to allow for fast evaluation of density estimates and integrals thereof. We utilize this for a fast renormalization scheme that ensures that estimates are bona fide copula densities and additionally improves the estimators' accuracy. The performance of the methods is illustrated by simulations.
Vine copulas allow to build flexible dependence models for an arbitrary number of variables using only bivariate building blocks. The number of parameters in a vine copula model increases quadratically with the dimension, which poses new challenges in high-dimensional applications. To alleviate the computational burden and risk of overfitting, we propose a modified Bayesian information criterion (BIC) tailored to sparse vine copula models. We show that the criterion can consistently distinguish between the true and alternative models under less stringent conditions than the classical BIC. The new criterion can be used to select the hyper-parameters of sparse model classes, such as truncated and thresholded vine copulas. We propose a computationally efficient implementation and illustrate the benefits of the new concepts with a case study where we model the dependence in a large stock stock portfolio.
Statistically simulated time series of wave parameters are required for many coastal and offshore engineering applications, often at the resolution of approximately one hour. Various studies have relied on autoregressive moving-average (ARMA) processes to simulate synthetic series of wave parameters in a Monte Carlo sense. However, accurately representing inter-series dependencies has remained a challenge. In particular, the relationship between wave height and period statistics is complex, due to the limiting steepness condition. Here, we present a new simulation method for joint time series of significant wave height, mean zero-crossing periods and a directional regime variable. The latter distinguishes between northern and southwestern waves. The method rests on several model components which include renewal processes, Fourier series with random coefficients, ARMA processes, copulas and regime-switching. A particular feature is a data-driven estimate for a wave height-dependent limiting wave steepness condition which is used to facilitate copulabased dependence modeling. The method was developed for and applied to a data set in the Southern North Sea. For this site, the method could simulate time series with realistic annual cycles and inter-annual variability. In the time series data, the bivariate distribution of significant wave height and mean zero-crossing period was well represented. An influence of the directional regime on the bivariate distribution could also be modeled. However, the influence was not as strong in simulated data as in observed data. Finally, simulated series captured duration and inter-arrival time of storm events well. Potential applications for output of the simulation method range from the assessment of coastal risks or design of coastal structures to the planning and budgeting of offshore operations.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.