“…There is a long tradition of leveraging computer vision techniques for aerial scene parsing. Earlier works (Liu, Liu, 2014, Blaschke et al, 2004, Predoehl et al, 2013 mainly lie on exploring effective visual features and semantic modeling approaches. Recently, deep CNNs have been widely explored in this field and taken a giant leap (Marcos et al, 2018a, Sun et al, 2019, Azimi et al, 2019, Kellenberger et al, 2019, Cheng et al, 2019, Marcos et al, 2018a, Mou et al, 2019.…”