Evaluating others’ actions as praiseworthy or blameworthy is a fundamental aspect of human nature. A seminal study published in 2007 suggested that the ability to form social evaluations based on third-party interactions emerges within the first year of life, considerably earlier than previously thought (Hamlin, Wynn, & Bloom, 2007). In this study, infants demonstrated a preference for a character (i.e., a shape with eyes) who helped, over one who hindered, another character who tried but failed to climb a hill. This study sparked a new line of inquiry into infants’ social evaluations; however, numerous attempts to replicate the original findings yielded mixed results, with some reporting effects not reliably different from chance. These failed replications point to at least two possibilities: (1) the original study may have overestimated the true effect size of infants’ preference for helpers, or (2) key methodological or contextual differences from the original study may have compromised the replication attempts. Here we present a pre-registered, closely coordinated, multi-laboratory, standardized study aimed at replicating the helping/hindering finding using a well-controlled video version of the hill show. We intended to (1) provide a precise estimate of the true effect size of infants’ preference for helpers over hinderers, and (2) determine the degree to which infants’ preferences are based on social features of the Helper/Hinderer scenarios. XYZ labs participated in the study yielding a total sample size of XYZ infants between the ages of 5.5 and 10.5 months. Brief summary of results will be added after data collection.