Towards an Efficient CNN Inference Architecture Enabling In-Sensor Processing

Pantho, Jubaer Hossain; Bhowmik, Pankaj; Bobda, Christophe

doi:10.3390/s21061955

Cited by 10 publications

(6 citation statements)

References 46 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To bring any of our CNN-based systems to the real world, we will embed it into a programmable logic device. Although it is well-known that CNN-based algorithms are computationally intense and require vast computational resources and dynamic power for computation of convolutional operations, in recent years, some programmable devices have been specifically developed to run these kinds of algorithms in real-time [64]. In this respect, some researchers have already successfully tested a variety of hardware implementation methods for different CNN-based structures, mostly based on field-programmable gate arrays (FPGA) architectures [65,66].…”

Section: Discussionmentioning

confidence: 99%

Detection of Negative Stress through Spectral Features of Electroencephalographic Recordings and a Convolutional Neural Network

Martínez-Rodrigo

García-Martínez

Huerta

et al. 2021

Sensors

View full text Add to dashboard Cite

In recent years, electroencephalographic (EEG) signals have been intensively used in the area of emotion recognition, partcularly in distress identification due to its negative impact on physical and mental health. Traditionally, brain activity has been studied from a frequency perspective by computing the power spectral density of the EEG recordings and extracting features from different frequency sub-bands. However, these features are often individually extracted from single EEG channels, such that each brain region is separately evaluated, even when it has been corroborated that mental processes are based on the coordination of different brain areas working simultaneously. To take advantage of the brain’s behaviour as a synchronized network, in the present work, 2-D and 3-D spectral images constructed from common 32 channel EEG signals are evaluated for the first time to discern between emotional states of calm and distress using a well-known deep-learning algorithm, such as AlexNet. The obtained results revealed a significant improvement in the classification performance regarding previous works, reaching an accuracy about 84%. Moreover, no significant differences between the results provided by the diverse approaches considered to reconstruct 2-D and 3-D spectral maps from the original location of the EEG channels over the scalp were noticed, thus suggesting that these kinds of images preserve original spatial brain information.

show abstract

Section: Discussionmentioning

confidence: 99%

Detection of Negative Stress through Spectral Features of Electroencephalographic Recordings and a Convolutional Neural Network

Martínez-Rodrigo

García-Martínez

Huerta

et al. 2021

Sensors

View full text Add to dashboard Cite

show abstract

“…This behaviour is common for all the platforms and algorithms, except for the ConvDirect on the Xavier, where the scalability is limited by the improper use of cache memories in a multithreaded scenario. 5 Leaving apart this outlying result, the algorithm Focusing on energy efficiency, we observe different trends depending on the selected NVP model, number of threads, and platform. The first observation is that the best energy efficiency is not always obtained by increasing the number of threads.…”

Section: Performance and Energy Efficiency Scalabilitymentioning

confidence: 94%

“…ARM Cortex-M CPUs) or low-power processors (e.g. ARM Cortex-A CPUs), the optimisation of this operator is strongly focused on reducing its energy consumption [5].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Performance–energy trade-offs of deep learning convolution algorithms on ARM processors

et al. 2023

View full text Add to dashboard Cite

In this work, we assess the performance and energy efficiency of high-performance codes for the convolution operator, based on the direct, explicit/implicit lowering and Winograd algorithms used for deep learning (DL) inference on a series of ARM-based processor architectures. Specifically, we evaluate the NVIDIA Denver2 and Carmel processors, as well as the ARM Cortex-A57 and Cortex-A78AE CPUs as part of a recent set of NVIDIA Jetson platforms. The performance–energy evaluation is carried out using the ResNet-50 v1.5 convolutional neural network (CNN) on varying configurations of convolution algorithms, number of threads/cores, and operating frequencies on the tested processor cores. The results demonstrate that the best throughput is obtained on all platforms with the Winograd convolution operator running on all the cores at their highest frequency. However, if the goal is to reduce the energy footprint, there is no rule of thumb for the optimal configuration.

show abstract

“…The errors of each set of training data are summed up. In stochastic gradient methods [70][71][72][73][74][75][76][77], the cost and sum of errors is used to update current model parameters to reduce the distance from the optimal point in the parameter space. The equation of binary cross entropy is shown as follows:…”

Section: Expansion Joint Device Recognitionmentioning

confidence: 99%

Detection and Identification of Expansion Joint Gap of Road Bridges by Machine Learning Using Line-Scan Camera Images

Kim

Cho

et al. 2021

ASI

View full text Add to dashboard Cite

Recently, the lack of expansion joint gaps on highway bridges in Korea has been increasing. In particular, with the increase in the number of days during the summer heatwave, the narrowing of the expansion joint gap causes symptoms such as expansion joint damage and pavement blow-up, which threaten traffic safety and structural safety. Therefore, in this study, we developed a machine vision (M/V)-technique-based inspection system that can monitor the expansion joint gap through image analysis while driving at high speed (100 km/h), replacing the current manual method that uses an inspector to inspect the expansion joint gap. To fix the error factors of image analysis that happened during the trial application, a machine learning method was used to improve the accuracy of measuring the gap between the expansion joint device. As a result, the expansion gap identification accuracy was improved by 27.5%, from 67.5% to 95.0%, and the use of the system reduces the survey time by more than 95%, from an average of approximately 1 h/bridge (existing manual inspection method) to approximately 3 min/bridge. We assume, in the future, maintenance practitioners can contribute to preventive maintenance that prepares countermeasures before problems occur.

show abstract

Towards an Efficient CNN Inference Architecture Enabling In-Sensor Processing

Cited by 10 publications

References 46 publications

Detection of Negative Stress through Spectral Features of Electroencephalographic Recordings and a Convolutional Neural Network

Detection of Negative Stress through Spectral Features of Electroencephalographic Recordings and a Convolutional Neural Network

Performance–energy trade-offs of deep learning convolution algorithms on ARM processors

Detection and Identification of Expansion Joint Gap of Road Bridges by Machine Learning Using Line-Scan Camera Images

Contact Info

Product

Resources

About