“…[4], [12], [13], [118], [119], [120], [121], [122], [123], [124](Section 4.3) Optimization Methods (Section 5) Distributionally Robust Optimization (Section 5.1) [1], [125], [126], [127], [128], [129], [130], [131], [132], [133], [134](Section 5.1.1∼ 5.1.3) [135], [136], [137], [138] (Section 5.1.4) Invariance-Based Optimization (Section 5.2) [5], [139], [140](Section 5.2) In real scenarios where observations are made in the form of images or sentences instead of structured data, high-level abstract information needs to be extracted from low-level data [27], and a few existing works [34], [35], [36] propose to recover causal factorization through disentanglement.…”