Background: The selection of patients with large-vessel occlusion (LVO) stroke for endovascular treatment (EVT) depends on patient characteristics and procedural metrics. The relation of these variables to functional outcome after EVT has been assessed in numerous datasets from both randomized controlled trials (RCT) and real-world registries, but whether differences in their case mix modulate outcome prediction is unknown. Methods: We leveraged data from individual patients with anterior LVO stroke treated with EVT from completed RCTs from the Virtual International Stroke Trials Archive ( N = 479) and from the German Stroke Registry ( N = 4079). Cohorts were compared regarding (i) patient characteristics and procedural pre-EVT metrics, (ii) these variables’ relation to functional outcome, and (iii) the performance of derived outcome prediction models. Relation to outcome (functional dependence defined by a modified Rankin Scale score of 3–6 at 90 days) was analyzed by logistic regression models and a machine learning algorithm. Results: Ten out of 11 analyzed baseline variables differed between the RCT and real-world cohort: RCT patients were younger, had higher admission NIHSS scores, and received thrombolysis more often (all p < 0.0001). Largest differences at the level of individual outcome predictors were observed for age (RCT: adjusted odds ratio (aOR), 1.29 (95% CI, 1.10–1.53) vs real-world aOR, 1.65 (95% CI, 1.54–1.78) per 10-year increments, p < 0.001). Treatment with intravenous thrombolysis was not significantly associated with functional outcome in the RCT cohort (aOR, 1.64 (95 % CI, 0.91–3.00)), but in the real-world cohort (aOR, 0.81 (95% CI, 0.69–0.96); p for cohort heterogeneity = 0.056). Outcome prediction was more accurate when constructing and testing the model using real-world data compared to construction with RCT data and testing on real-world data (area under the curve, 0.82 (95% CI, 0.79–0.85) vs 0.79 (95% CI, 0.77–0.80), p = 0.004). Conclusions: RCT and real-world cohorts considerably differ in patient characteristics, individual outcome predictor strength, and overall outcome prediction model performance.