“…• We also analyze the impact of the depth of hidden layers in the channel-wise weighting module on both training loss and validation loss. Moreover, in this module, we tackle the information loss due to the global average pooling in (Zhang et al, 2020) by replacing it with GRU.…”