“…A few works also look at continual learning from the perspectives of the loss landscape and dynamics of optimization [Mirzadeh et al, 2020, Mirzadeh et al, 2020b. Modularity-based methods allocate different subsets of the parameters to each task [Rusu et al, 2016, Yoon et al, 2018, Jerfel et al, 2019, Li et al, 2019, Wortsman et al, 2020, Mirzadeh et al, 2020a.…”