A Winograd-Based Integrated Photonics Accelerator for Convolutional Neural Networks

Mehrabian, Armin; Miscuglio, Mario; Alkabani, Yousra; Sorger, Volker J.; El‐Ghazawi, Tarek

doi:10.1109/jstqe.2019.2957443

Cited by 36 publications

(22 citation statements)

References 52 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…This enables de-coupling the SNR at each layer from the final output result by, for instance, electronic signal restoration, with a possible drawback introduced being latency. Nonetheless, noise is not only a limiting factor, but bears also an opportunity for NN usages; for instance, training [57] an NN with noise and performing inference tasks while dropping the absolute accuracy by about 2-3% also makes the system more robust against physical noise since the NN was conditioned with noise 'stress' [34]. Incidentally, the small-kernel algorithms such as the Winograd transformation offer an interesting alternative to the FFT filtering approach, given that many CNNs are optimized for small kernel (<13 × 13) sizes.…”

Section: Resultsmentioning

confidence: 99%

“…Given that the above mentioned high-relevance of CNNs performing the convolution operations in the optical domain is of significant interest, next we turn to scaling laws of convolution processing. Since a convolution is a high dimensional ∼N 3 problem, parallelization strategies such as multiplexing is key, which hence is synergistic to optics and photonics [32][33][34]. An interesting inspiration can be borrowed from Fourier optics [9]; instead of performing the cumbersome (allto-all) convolution between the data and the kernel, a simpler dot-product multiplication can be performed in the in the Fourier domain instead.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Integrated photonic FFT for photonic tensor operations towards efficient and high-speed neural networks

et al. 2020

Self Cite

View full text Add to dashboard Cite

AbstractThe technologically-relevant task of feature extraction from data performed in deep-learning systems is routinely accomplished as repeated fast Fourier transforms (FFT) electronically in prevalent domain-specific architectures such as in graphics processing units (GPU). However, electronics systems are limited with respect to power dissipation and delay, due to wire-charging challenges related to interconnect capacitance. Here we present a silicon photonics-based architecture for convolutional neural networks that harnesses the phase property of light to perform FFTs efficiently by executing the convolution as a multiplication in the Fourier-domain. The algorithmic executing time is determined by the time-of-flight of the signal through this photonic reconfigurable passive FFT ‘filter’ circuit and is on the order of 10’s of picosecond short. A sensitivity analysis shows that this optical processor must be thermally phase stabilized corresponding to a few degrees. Furthermore, we find that for a small sample number, the obtainable number of convolutions per {time, power, and chip area) outperforms GPUs by about two orders of magnitude. Lastly, we show that, conceptually, the optical FFT and convolution-processing performance is indeed directly linked to optoelectronic device-level, and improvements in plasmonics, metamaterials or nanophotonics are fueling next generation densely interconnected intelligent photonic circuits with relevance for edge-computing 5G networks by processing tensor operations optically.

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Integrated photonic FFT for photonic tensor operations towards efficient and high-speed neural networks

et al. 2020

Self Cite

View full text Add to dashboard Cite

show abstract

“…1a). A multi-level (2-layer) perceptron feed-forward neural network based on these novel all-optical photonic neurons, is trained using a photonic hardware model 24…”

Section: Introductionmentioning

confidence: 99%

“…1a). A multi-level (2-layer) perceptron feed-forward neural network based on these novel all-optical photonic neurons, is trained using a photonic hardware model 24 to accurately simulate the complete process and learn the weights including analog noise. The network is ultimately emulated on an open-source machine learning framework.…”

Section: Introductionmentioning

confidence: 99%

All-Chalcogenide Programmable All-Optical Deep Neural Networks

Ting

Pastor

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

Deeplearning algorithms are revolutionising many aspects of modern life. Typically, they are implemented in CMOS-based hardware with severely limited memory access times and inefficient data-routing. All-optical neural networks without any electro-optic conversions could alleviate these shortcomings. However, an all-optical nonlinear activation function, which is a vital building block for optical neural networks, needs to be developed efficiently on-chip. Here, we introduce and demonstrate both optical synapse weighting and all-optical nonlinear thresholding using two different effects in one single chalcogenide material. We show how the structural phase transitions in a wide-bandgap phase-change material enables storing the neural network weights via non-volatile photonic memory, whilst resonant bond destabilisation is used as a nonlinear activation threshold without changing the material. These two different transitions within chalcogenides enable programmable neural networks with near-zero static power consumption once trained, in addition to picosecond delays performing inference tasks not limited by wire charging that limit electrical circuits; for instance, we show that nanosecond-order weight programming and near-instantaneous weight updates enable accurate inference tasks within 20 picoseconds in a 3-layer all-optical neural network. Optical neural networks that bypass electro-optic conversion altogether hold promise for network-edge machine learning applications where decision-making in real-time are critical, such as for autonomous vehicles or navigation systems such as signal pre-processing of LIDAR systems.

show abstract

“…During this time, advances in computer hardware have enabled the implementation of ideas developed in the field of artificial intelligence during the last century, leading to impressive demonstrations of the potential of artificial neural networks (ANNs) in machine learning (ML). These techniques have been adopted to address different problems into a variety of fields in the natural sciences, including for solving inverse problems in nanophotonics, [15,17,18] nanospectroscopy, [19] material sciences, [20,21] or microscopy. [22,23] Typically, supervised learning employing ANNs is used in two main procedures aimed at solving the inverse problem: classification [19,24] and regression.…”

Section: Introductionmentioning

confidence: 99%

Mie Sensing with Neural Networks: Recognition of Nano‐Object Parameters, the Invisibility Point, and Restricted Models

Movsesyan

Besteiro

Wang

et al. 2021

Advcd Theory and Sims

View full text Add to dashboard Cite

In this work, artificial neural networks (ANNs) are used to recognize nano-objects solely from the absorbance spectrum of a macroscopic sample. For this, ANNs with two recognition schemes are constructed. The first one is designed to recognize ensembles of dielectric scatterers. The second ANN model recognizes the dimensions of gold nanospheres in a mixture and the refractive index of a matrix. A challenge in the first scheme arises at and near the invisibility point, i.e., when the refractive index of nanoparticles is close to that of the medium. Of course, particle recognition in this regime faces fundamental physical limitations. However, such recognition near the invisibility point is possible, and this study reveals its unique properties. Interestingly, the recognition process for the refractive index in the vicinity of the invisibility point shows very small errors. In contrast, the errors for recognition of the radius grow strongly near this point. Another regime with limited recognition occurs when the extinction spectra are not unique and can correspond to different realizations of nanoparticle mixtures. The recognition schemes proposed and investigated herein can find their applications in sensing.

show abstract

A Winograd-Based Integrated Photonics Accelerator for Convolutional Neural Networks

Cited by 36 publications

References 52 publications

Integrated photonic FFT for photonic tensor operations towards efficient and high-speed neural networks

Integrated photonic FFT for photonic tensor operations towards efficient and high-speed neural networks

All-Chalcogenide Programmable All-Optical Deep Neural Networks

Mie Sensing with Neural Networks: Recognition of Nano‐Object Parameters, the Invisibility Point, and Restricted Models

Contact Info

Product

Resources

About