“…Although measuring the distribution of plausible effect sizes can provide important insights about the robustness of an empirical result (e.g., Coretta et al, 2023;Kümpel & Hoffmann, 2022), we argue that it is incomplete (see also, Mathur et al, 2023;Young & Holsteen, 2017). To reap the full benefits of involving multiple analysts, we should also examine the broader context in which analysts made their choices: their prior beliefs about 1 Alternative approaches for synthesizing outcomes in many-analysts projects (e.g., considering only the sign of the effect size; focusing on evidential measures such as p-values or Bayes factors) do not seem satisfactory, especially when quantifying the size of the effect is essential (see e.g., Mathur et al, 2023).…”