“…Recently, coding theory has provided a popular tool set for mitigating the effects of stragglers. Codes have recently been used in the context of machine learning, and applied to problems such as data shuffling [19,17], distributed matrix-vector and matrix-matrix multiplication [17,10], as well as distributed training [20,32,15,11,22,7,33]. In the context of distributed, gradient-based algorithms, Tandon et al [29] introduced gradient coding as a means to mitigate straggler delays.…”