“…The calibration block is a 3D 1×1×1 point-wise convolution to solve the problem of feature dimension mismatch; thus, all subsequent blocks have a stride of 1. The number of searchable blocks and the stride of calibration block in six layers are [4,4,4,4,4,1] and [2,2,2,1,2,1], respectively. The output channels of the stem block and six layers are 32 and [24,40,80,96,192,320], respectively.…”