“…In regression-based methods, geometry of text is directly predicted from convolutional features [2, 11, 12, 17, 19, 22, 23, 42, 50-52, 56, 77] or RoI features [25,37,55,72], and then used to decode to produce the predicted results based on given reference points or boxes. In instance segmentation based methods, typically, Mask R-CNN based methods [28,33,43,57,59,60], an extra branch is added to a detection framework. The results are achieved via instance segmentation, getting rids of learning target confusion problem [26,61] which exists in regression-based methods.…”