Objective: Recognized as a premier approach for adverse event (AE) detection, trigger tools have been developed for multiple clinical settings outside the emergency department (ED). We recently derived and tested an ED trigger tool (EDTT) with enhanced features for high-yield detection of harm, consisting of 30 triggers associated with AEs. In this study, we validate the EDTT in an independent sample and compare record selection approaches to optimize yield for quality improvement. Methods: This is a retrospective observational study using data from 13 months of visits to an urban, academic ED by patients aged ≥ 18 years (92,859 records). We conducted standard two-tiered trigger tool reviews on an independent validation sample of 3,724 records with at least one of the 30 triggers found associated with AEs in our previous derivation sample (N = 1,786). We also tested three new candidate triggers and reviewed 72 records with no triggers for comparison purposes. We compare derivation and validation samples on: 1) triggers showing persistent associations with AEs, 2) AE yield (AEs detected/records reviewed), and 3) representativeness of AE types detected. We use bivariate associations of triggers with AEs as the basis for trigger selection. We then use multivariable modeling in the combined derivation and validation samples to determine AE risk scores using trigger weights. This allows us to predict occurrence of AEs and derive population prevalence estimates. Finally, we compare yield for detection of AEs under three record selection strategies (random selection, trigger counts, weighted trigger counts). Results: Twenty-four of the 30 triggers were confirmed to be associated with AEs on bivariate testing. Three previously marginal triggers and two of three new candidate triggers were also found to be associated with AEs. The presence of any of these 29 triggers was associated with an AE rate of 10% in our selected sample (compared to 1.1% for none, p < 0.001). The risk of an AE increased with number of triggers. Combining data from both phases, we identified 461 AEs in 429 unique visits in 5,582 records reviewed. Our multivariable model (which emphasized parsimony) retained 12 triggers with a ROC AUC of 82% in both samples. Selecting records for review based on number of triggers improves yield to 14% for 4+ triggers (top 10% of visits) and to 28% for 8+ (top 1%). A weighted trigger count has corresponding yields of 18 and 38%. The method for selecting records for review did not appear to affect event-type representativeness, with similar distributions of event types and severities detected. Conclusions: In this single-site study of the EDTT we observed high levels of validity in trigger selection, yield, and representativeness of AEs, with yields that are superior to estimates for traditional approaches to AE detection. Record selection using weighted triggers outperforms a trigger count threshold approach and far outperforms random sampling from records with at least one trigger. The EDTT is a promising efficient ...