Performance-monitoring event-related brain potentials (ERPs), such as the error-related negativity (ERN) and reward positivity (RewP), are advocated as biomarkers of depression symptoms and risk. However, a recent meta-analysis indicated effect size heterogeneity in the ERN and RewP literatures. Hence, advocating these ERPs as biomarkers of depression might be premature or possibly misguided due to the selective reporting of significant analyses on the part of researchers (e.g., p-hacking or omission of non-significant findings). The present study quantified the degree of selective reporting and the evidential value for a true relationship between depression and ERN and RewP using a p-curve analysis. We predicted that the ERN and RewP literatures would fail to show evidential value for a relationship between each ERP and depression. Contrary to expectations, both literatures showed evidential value, albeit weak.The statistical power of the included ERN studies was between 20% and 25%, and the statistical power of the RewP was around 27%. Taken together, these findings provide support for a relationship between these ERPs and depression, which strengthens claims that these ERPs represent candidate biomarkers of depression symptoms and risk. In light of the evidence for these relationships being weak, some recommendations moving forward include conducting a priori power analyses, increasing sample sizes to improve statistical power, assessing the internal consistency of ERP scores, and carefully planning statistical approaches to maximize power.Keywords: performance monitoring, depression, p curve, error-related negativity (ERN), reward positivity (RewP) 3 Performance-monitoring event-related brain potentials (ERPs), such as the error-related negativity (ERN) and reward positivity (RewP), are advocated as biomarkers or endophenotypes of depression symptoms and risk (e.g., Meyer, Bress, Hajcak, & Gibb, 2018;Nelson, Perlman, Klein, Kotov, & Hajcak, 2016). However, recent meta-analytic work indicates that both ERN and RewP literatures are replete with heterogenous effect sizes (Moran, Schroder, Kneip, & Moser, 2017). Although heterogeneous effect sizes might be due to known or unknown moderating factors, it is possible that heterogeneity is due to the selective reporting of significant analyses or undisclosed use of other questionable research practices (QRPs), and this possibility was not examined in the meta-analysis. If effect size heterogeneity is due to selective reporting/omission of non-significant findings or other QRPs, then advocating performancemonitoring ERPs as biomarkers of depression would be misguided or at least premature. The aim of the present registered report was to the quantify the degree of selective reporting and the evidential value for a true relationship between depression and ERN and RewP using a priori registered hypotheses and methods.