2020
DOI: 10.1109/jssc.2019.2939888
|View full text |Cite
|
Sign up to set email alerts
|

A 12.08-TOPS/W All-Digital Time-Domain CNN Engine Using Bi-Directional Memory Delay Lines for Energy Efficient Edge Computing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
20
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 55 publications
(20 citation statements)
references
References 19 publications
0
20
0
Order By: Relevance
“…In the traditional digital domain of multiplication and addition, large energy consumption of switching capacitors is wasted. Time-domain computing is a method to implement multiplication and addition using the width or delayed superposition of pulses [49][50][51][52]. As shown in Figure 7, a common time-domain calculation consists of two delay lines: reference line and delay line.…”
Section: Cim Architecture Beyond Mixed Solutionmentioning
confidence: 99%
See 2 more Smart Citations
“…In the traditional digital domain of multiplication and addition, large energy consumption of switching capacitors is wasted. Time-domain computing is a method to implement multiplication and addition using the width or delayed superposition of pulses [49][50][51][52]. As shown in Figure 7, a common time-domain calculation consists of two delay lines: reference line and delay line.…”
Section: Cim Architecture Beyond Mixed Solutionmentioning
confidence: 99%
“…Although the time domain is a part of the analog domain, its effective values are only 0 and 1, which can be calculated as a digital domain in nature. Unlike voltage-domain calculations that require large analog-to-digital conversions, the conversion from the time domain to the digital domain can be achieved simply by a counter or a reference line delay detection [49][50][51][52]. The energy-efficient time-domain computing is suitable for low-power AI processors.…”
Section: Cim Architecture Beyond Mixed Solutionmentioning
confidence: 99%
See 1 more Smart Citation
“…TD cores can be implemented using two different architectures, including spatially unrolled (SU) [6], [7], [9] and recursive (REC) [10], [11]. In the SU architecture, inputs and weights are stored in spatially distributed storage elements with a dedicated processing element (PE).…”
Section: Introductionmentioning
confidence: 99%
“…Within the purview of time-based accumulation, pulsewidth [20], [21] and pulse-delay [18], [22], [23] are the two modulation schemes that have been demonstrated onchip. Of these, pulse-(or, event) delay is more promising for MAC applications due to (1) free addition/subtraction in case of delay, and (2) requirement of peripheral pulse re-routing circuitry requirements in the prior.…”
Section: Introductionmentioning
confidence: 99%