Accelerators for biologically-inspired attention and recognition

Park, Mi Sun; Zhang, Chuanjun; DeBole, Michael; Kestur, Srinidhi; Narayanan, Vijaykrishnan; Irwin, M.J.

doi:10.1145/2463209.2488900

Cited by 7 publications

(4 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For example, in [16], the authors have suggested a real time implementation of their proposed saliency based algorithm on a highly parallel Single Instruction Multiple Data (SIMD) architecture called ProtoEye, which consists of a 2D array of mixed analogdigital processing elements (PE). Recent efforts were presented in [17] [18], which propose a parallel implementation of this model with multi-GPU and multi-FPGA system reaching real time performance and good recognition accuracy.…”

Section: Previous Related Workmentioning

confidence: 99%

“…The Table III represents the speedups in execution time gained by our pipeline architecture and two existing HMAX accelerators implementations for 256 × 256 grayscale images [10] [18]. The initial design of the HMAX accelerator [10] takes about 21.81ms per image with a frame rate of 45.85 fps, whereas the second design [18] takes about 11.04ms per image with a frame rate of 90.57f ps. Our multi-processor architecture gave an overall speedups of 3.14X and 1.52X over the initial design and the second design, although it is mapped to a single FPGA only.…”

Section: Timing Performancementioning

confidence: 99%

See 1 more Smart Citation

A Bottom-up Approach for Visual Object Recognition on FPGA based Embedded Multiprocessor Architecture

Chenini¹

2017

ijacsa

View full text Add to dashboard Cite

Abstract-This paper presents an object recognition approach of outdoor autonomous systems identifying the nature of the interested object when observing an image. Therefore, seeking for effective and robust recognition method, the proposed approach is performed using a novel saliency based feature detector/descriptor which is combined with an object classifier to identify the nature of objects in an indoor or an outdoor environment. As known, bottom-up visual attention computational models need a considerable computational power and communication cost. A major challenge in this work is to deal with such image processing applications managing a large amount of the information processing and to work within real-time requirements by improving the processing speed.Based on interesting approach designing specific architectures for parallelism, this paper presents a solution for rapid prototyping of saliency-based object recognition applications. In order to meet computation and communication requirement, the developed pipelined architectures are composed of identical processing modules which can work concurrently with distributed memories and compute in parallel several sequential tasks with a high computational cost. We present hardware implementations with performance results on an Xilinx System-on-Programmable Chip (SoPC) target. The experimental results including execution times and application speedups as well as requirements in terms of computing resources show that the proposed homogeneous network of processors is efficient for embedding the proposed image processing application.

show abstract

Section: Previous Related Workmentioning

confidence: 99%

Section: Timing Performancementioning

confidence: 99%

A Bottom-up Approach for Visual Object Recognition on FPGA based Embedded Multiprocessor Architecture

Chenini¹

2017

ijacsa

View full text Add to dashboard Cite

show abstract

“…Much work has already been completed on the design of dedicated hardware for acceleration of MLPs [18]- [23] or other types of NNs, be they Convolutional Neural Networks [5]- [9], Deep Belief Networks [10]- [12], Hierarchical Model and X [13], or more biologically accurate models [14]- [17]. We, however, propose what we believe is the first instance of an NN accelerator architecture that supports the simultaneous execution of multiple NNs.…”

Section: B Neural Network Acceleratorsmentioning

confidence: 99%

“…Such an interface allows NN hardware diversity to expand while, at the same time, encouraging a diverse set of software use cases to be explored. From a hardware perspective, one would like to see the continued exploration of the full spectrum of NN accelerator technologies from dedicated NN digital logic units [5]- [13], [18]- [23] to biologically inspired analog/sub-threshold implementations [14]- [17]. Similarly, from a software perspective, one wants to encourage both explicit and implicit usage models to grow.…”

Section: Introductionmentioning

confidence: 99%

Towards General-Purpose Neural Network Computing

Eldridge¹,

Waterland

Seltzer³

et al. 2015

2015 International Conference on Parallel Architecture and Compilation (PACT)

View full text Add to dashboard Cite

Abstract-Machine learning is becoming pervasive; decades of research in neural network computation is now being leveraged to learn patterns in data and perform computations that are difficult to express using standard programming approaches. Recent work has demonstrated that custom hardware accelerators for neural network processing can outperform software implementations in both performance and power consumption. However, there is neither an agreed-upon interface to neural network accelerators nor a consensus on neural network hardware implementations. We present a generic set of software/hardware extensions, X-FILES, that allow for the generalpurpose integration of feedforward and feedback neural network computation in applications. The interface is independent of the network type, configuration, and implementation. Using these proposed extensions, we demonstrate and evaluate an example dynamically allocated, multi-context neural network accelerator architecture, DANA. We show that the combination of X-FILES and our hardware prototype, DANA, enables generic support and increased throughput for neural-networkbased computation in multi-threaded scenarios. These diverse implementations and usage cases drive fascinating innovation. As diversity increases, however, a gap is developing between these innovations and the state of today's hardware and software. I. INTRODUCTION1 As such it is worth con-

show abstract