“…In scenarios where the dataset exhibits definitive task boundaries, the strategy of task-level routing has been implemented in the context of multi-task learning [Ponti et al, 2022;Wang et al, 2023a;Caccia et al, 2023]. This technique is predicated on a routing mechanism in which w τ = f (t(x i )) indicates that inputs x, affiliated with the same task τ i , share a common set of routing parameters w τ .…”