“…In recent studies, the number of experiments necessary to identify a high-performing material has been used as a metric for monitoring SL performance. 8,16,33 Modeling benchmark datasets and tools, such as Olympus, 34 MatBench, 35 and DiSCoVeR, 36 have started to standardize assessment of model and dataset performance. Notably, a recent study by Rohr et al 37 considers additional metrics that quantify SL performance relative to a benchmark case (typically random search).…”