Systematic assessment of the replicability and generalizability of preclinical findings: Impact of protocol harmonization across laboratory sites

Arroyo-Araujo, María; Voelkl, Bernhard; Laloux, Clément; Novak, Janja; Koopmans, Bastijn; Waldron, Ann-Marie; Seiffert, Isabel; Stirling, Helen; Aulehner, Katharina; Janhunen, Sanna; Ramboz, Sylvie; Potschka, Heidrun; Holappa, Johanna; Fine, Tania; Loos, Maarten; Boulanger, Bruno; Würbel, Hanno; Kas, Martien

doi:10.1371/journal.pbio.3001886

Cited by 9 publications

(4 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Alternatively, labs developed a general protocol but adapted it to fit their own respective budget with what resources they had. Of note, recent work has suggested that harmonization reduces between-lab variability, however, systematic heterogenization did not reduce variability further ( Arroyo-Araujo et al, 2022 ); this may suggest that, even in fully harmonized protocols, enough uncontrolled heterogeneity exists that further purposeful heterogenization has little effect. Another barrier identified was ethics approval for animal experiments at all the labs ( Llovera et al, 2015 ).…”

Section: Resultsmentioning

confidence: 99%

A systematic assessment of preclinical multilaboratory studies and a comparison to single laboratory studies

Hunniford

Grudniewicz

Fergusson

et al. 2023

eLife

View full text Add to dashboard Cite

Background: Multicentric approaches are widely used in clinical trials to assess generalizability of findings, however they are novel in laboratory-based experimentation. It is unclear how multilaboratory studies may differ in conduct and results from single lab studies. Here we synthesized characteristics of these studies and quantitatively compared their outcomes to those generated by single laboratory studies.Methods: MEDLINE and Embase were systematically searched. Screening and data extractions were completed in duplicate by independent reviewers. Multilaboratory studies investigating interventions using in vivo animal models were included. Study characteristics were extracted. Systematic searches were then performed to identify single center studies matched by intervention and disease. Difference in standardized mean differences (DSMD) was then calculated across studies to assess differences in effect estimates based on study design (>0 indicates larger effects in single center studies).Results: Sixteen multilaboratory studies met inclusion criteria and were matched to 100 single center studies. The multicenter study design was applied across a diverse range of diseases, including traumatic brain injury, myocardial infarction, and diabetes. The median number of centers was 4 (range 2-6) and the median sample size was 111 (range 23-384) with rodents most frequently used. Multicenter studies adhered to practices that reduce risk of bias significantly more often than single center studies. Multicenter studies also demonstrated significantly smaller effect sizes than single center studies (DSMD 0.72 [95% confidence interval 0.43-1]).Conclusion: Multilaboratory studies demonstrate trends that have been well recognized in clinical research (i.e. smaller treatment effects with multicentric evaluation and greater rigour in study design). This approach may provide a method to robustly assess interventions and generalizability of findings between laboratories.Funding: uOttawa Junior Clinical Research Chair; The Ottawa Hospital Anesthesia Alternate Funds Association; Canadian Anesthesia Research Foundation; Government of Ontario Queen Elizabeth II Graduate Scholarship in Science and Technology.Clinical trial registration: PROSPERO CRD4201809398.

show abstract

Section: Resultsmentioning

confidence: 99%

A systematic assessment of preclinical multilaboratory studies and a comparison to single laboratory studies

Hunniford

Grudniewicz

Fergusson

et al. 2023

eLife

View full text Add to dashboard Cite

show abstract

“…The multinational EQIPD consortium ( E nhanced Q uality in P reclinical D ata, https://quality-preclinical-data.eu/) aimed to identify factors influencing the quality of data generated in preclinical research as a basis for recommendations enabling a smoother and successful transition from preclinical research to clinical application (9, 10). In a recent study, the consortium analyzed the impact of protocol harmonization on the replicability of data in the open field test, i.e., a behavioral paradigm with automated recording of the primary outcome parameters (11). The study demonstrated that harmonization of protocols can reduce between-site variability (11).…”

Section: Introductionmentioning

confidence: 99%

“…In a recent study, the consortium analyzed the impact of protocol harmonization on the replicability of data in the open field test, i.e., a behavioral paradigm with automated recording of the primary outcome parameters (11). The study demonstrated that harmonization of protocols can reduce between-site variability (11). Considering that the standardized application of scoring systems and their harmonization across sites can pose a particular challenge, we next addressed the question of whether a comparable impact can be observed for a frequently used paradigm, based on scoring of a variety of readout parameters by experimenters.…”

Section: Introductionmentioning

confidence: 99%

“…To allow comparability with two other subprojects of the EQIPD consortium that focused on an open field paradigm (11) and pharmaco-EEG, the study was performed in mice, representing the second most common species for Irwin/FOB testing (16, 24). MK-801 was chosen as the compound, considering its broad range of effects on various parameters from different functional domains assessed in the Irwin test.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

A Systematic Assessment of Robustness in CNS Safety Pharmacology

Reiber,

Stirling,

Ahuis

et al. 2024

Preprint

Self Cite

View full text Add to dashboard Cite

Irwin tests are key preclinical study elements for characterizing drug-induced neurological side effects. This multicenter study aimed to assess the robustness of Irwin tests across multinational sites during three stages of protocol harmonization. The projects were part of the EQIPD framework (Enhanced Quality in Preclinical Data, https://quality-preclinical-data.eu/), aiming to increase success rates in transition from preclinical testing to clinical application. Female and male NMRI mice were assigned to one of three groups (vehicle, 0.1 mg/kg MK-801, 0.3 mg/kg MK-801). Irwin scores were assessed at baseline and multiple times following injection of MK-801, a non-competitive NMDA antagonist, using local protocols (stage 1), a shared protocol with harmonized environmental design (stage 2), and fully harmonized Irwin scoring protocols (stage 3). The analysis based on the four functional domains (motor, autonomic, sedation, and excitation) revealed substantial data variability in stages 1 and 2. Although there was still marked overall heterogeneity between sites in stage 3 after complete harmonization of the Irwin scoring scheme, heterogeneity was only moderate within functional domains. When comparing treatment groups vs. vehicle, we found large effect sizes in the motor domain and subtle to moderate effects in the excitation-related and autonomic domain. The pronounced interlaboratory variability in Irwin datasets for the CNS-active compound MK-801 needs to be carefully considered by companies and experimenters when making decisions during drug development. While environmental and general study design had a minor impact, the study suggests that harmonization of parameters and their scoring can limit variability and increase robustness.

show abstract

Good Practice Guideline for Preclinical Alcohol Research: The STRINGENCY Framework

Meinhardt,

Gerlach,

Spanagel

2024

Current Topics in Behavioral Neurosciences

View full text Add to dashboard Cite

Systematic assessment of the replicability and generalizability of preclinical findings: Impact of protocol harmonization across laboratory sites

Cited by 9 publications

References 26 publications

A systematic assessment of preclinical multilaboratory studies and a comparison to single laboratory studies

A systematic assessment of preclinical multilaboratory studies and a comparison to single laboratory studies

A Systematic Assessment of Robustness in CNS Safety Pharmacology

Good Practice Guideline for Preclinical Alcohol Research: The STRINGENCY Framework

Contact Info

Product

Resources

About