“…However, the classical CNNs have proven to be impractical for hardware implementation. A number of processor‐per‐pixel array architectures, designed specifically for asynchronous data‐flow processing, have been presented in the literature, including the work on grey‐scale morphology 13, segmentation 14, skeletonization 18 and rank‐order filtering 19. However, the algorithm‐specific designs not only have a limited functionality, but also occupy relatively large silicon area, which is the major drawback when used as a co‐processor for a general‐purpose cellular processor array.…”