“…For CI-FAR(Krizhevsky and Hinton 2009) benchmarks, CIFAR10 and CIFAR100 were respectively used as the ID datasets, and six datasets were used as OOD test sets, including Textures(Cimpoi et al 2014), SVHN(Netzer et al 2011), iSUN(Xu et al 2015), Places365(Zhou et al 2017), LSUN-C(Yu et al 2015), and LSUN-R(Yu et al 2015). For largescale ImageNet benchmarks, two different sets of 100 Im-ageNet(Deng et al 2009) classes, namely ImageNet100-I and ImageNet100-II(Tao et al 2023), were used as ID sets considering that both sets have been used in related literature, and four OOD test datasets, Places(Zhou et al 2017), Textures, iNaturalist (Van Horn et al 2018, and SUN(Xiao et al 2010) were used for evaluation. There are no overlapped classes between OOD datasets and corresponding ID datasets.…”