“…In contrast, structured pruning is more friendly and efficient on various off-the-shelf deployment platforms, simultaneously speeding up network inference and reducing the memory overhead of CNNs. It can be further categorized into greedy-based pruning [25], [27], [30], [44], [52], [58], [60], [81], search-based pruning [17], [26], [54], [57], dynamic pruning [7], [12], [47], [63], [73], [76], and sparsity regularization-based pruning [31], [42], [45], [46], [51], [53], [56], [75], [78], [80], [84].…”