“…Large-scale perturbation datasets display a high rate of success in targeting individual genes, modest correlation across replicates, and mostly small global effects Table 1 Figure S1 Figure 2 Regression-based analyses do not out-perform uninformed baselines Figure 3 Figure S2 Table 4 Diverse published methods do not out-perform simple baselines Figure 5 A grammar of gene regulatory network (GRN) inference synthesizes diverse methods withinIntroduction Systematically determining causal effects of transcription factor (TF) activity is a fundamental problem in systems biology, with early studies even prior to whole-genome expression profiling (Akutsu, Miyano, & Kuhara, 1999;Duggan, Bittner, Chen, Meltzer, & Trent, 1999;Liang, Fuhrman, & Somogyi, 1998). Driven by new datasets, a raft of recent computational methods now predict genetic perturbation results in silico, with applications in embryology, stem cell reprogramming, and drug discovery (Amrute et al, 2022;Burdziak et al, 2023;Cui, Wang, Maan, & Wang, 2023;Hyttinen, Eberhardt, & Hoyer, 2012;Jiang et al, 2023;Kamimoto et al, 2023;Osorio et al, 2022;Qiu et al, 2022;Roohani, Huang, & Leskovec, 2022;Theodoris et al, 2023;Tran, Yang, Yang, & Ormerod, 2022;Wang et al, 2022;Yeo, Saksena, & Gifford, 2021). These expression forecasting methods aim to serve as a new type of general-purpose screening tool.…”