Predictive mean matching (PMM) is a standard technique for the imputation of incomplete continuous data. PMM imputes an actual observed value, whose predicted value is among a set of k ≥ 1 values (the so-called donor pool), which are closest to the one predicted for the missing case. PMM is usually better able to preserve the original distribution of the empirical data than fully parametric multiple imputation (MI) approaches, when empirical data deviate from their distributional assumptions. Use of PMM is therefore especially worthwhile in situations where model assumptions of fully parametric MI procedures are violated and where fully parametric procedures would yield highly implausible estimates. Unfortunately, today there are only a handful of studies that systematically tested the robustness of PMM and it is still widely unknown where exactly the limits of this procedure lie. I examined the performance of PMM in situations where data were skewed to varying degrees, under different sample sizes, missing data percentages, and using different settings of the PMM approach. It was found that small donor pools overall yielded better results than large donor pools and that PMM generally worked well, unless data were highly skewed and more than about 20% to 30% of the data had to be imputed. Also, PMM generally performed better when sample size was sufficiently large.
Background
The latest version of the International Classification of Diseases (ICD‐11) proposes a posttraumatic stress disorder (PTSD) diagnosis reduced to its core symptoms within the symptom clusters re‐experiencing, avoidance and hyperarousal. Since children and adolescents often show a variety of internalizing and externalizing symptoms in the aftermath of traumatic events, the question arises whether such a conceptualization of the PTSD diagnosis is supported in children and adolescents. Furthermore, although dysfunctional posttraumatic cognitions (PTCs) appear to play an important role in the development and persistence of PTSD in children and adolescents, their function within diagnostic frameworks requires clarification.
Methods
We compiled a large international data set of 2,313 children and adolescents aged 6 to 18 years exposed to trauma and calculated a network model including dysfunctional PTCs, PTSD core symptoms and depression symptoms. Central items and relations between constructs were investigated.
Results
The PTSD re‐experiencing symptoms strong or overwhelming emotions and strong physical sensations and the depression symptom difficulty concentrating emerged as most central. Items from the same construct were more strongly connected with each other than with items from the other constructs. Dysfunctional PTCs were not more strongly connected to core PTSD symptoms than to depression symptoms.
Conclusions
Our findings provide support that a PTSD diagnosis reduced to its core symptoms could help to disentangle PTSD, depression and dysfunctional PTCs. Using longitudinal data and complementing between‐subject with within‐subject analyses might provide further insight into the relationship between dysfunctional PTCs, PTSD and depression.
Abstract. Predictive mean matching (PMM) is a state-of-the-art hot deck multiple imputation (MI) procedure. The quality of its results depends, inter alia, on the availability of suitable donor cases. Applying PMM in small sample scenarios often found in psychological or medical research could be problematic, as there might not be many (or any) suitable donor cases in the data set. So far, there has not been any systematic research that examined the performance of PMM, when sample size is small. The present study evaluated PMM in various multiple regression scenarios, where sample size, missing data percentages, the size of the regression coefficients, and PMM’s donor selection strategy were systematically varied. Results show that PMM could be used in most scenarios, however results depended on the donor selection strategy: overall, PMM using either automatic distance-aided selection of donors ( Gaffert, Meinfelder, & Bosch, 2016 ) or using the nearest neighbor produced the best results.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.