2022
DOI: 10.1002/adma.202107754
|View full text |Cite
|
Sign up to set email alerts
|

A Reconfigurable Two‐WSe2‐Transistor Synaptic Cell for Reinforcement Learning

Abstract: learning, reinforcement learning (RL) is one of machine learning paradigms, which allow the machine to interact with the environment and update the policy according to the negative or positive reward signals. [10,11] The algorithms of RL [10] mainly include artificial neural network (ANN)-based deep Q-learning (DQN) and spiking neural network (SNN)based reward-modulated spike-timingdependent plasticity (R-STDP). Compared with ANN-based DQN that adopts error backpropagation and gradient descent to update the we… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

2
47
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
8

Relationship

1
7

Authors

Journals

citations
Cited by 72 publications
(49 citation statements)
references
References 54 publications
(83 reference statements)
2
47
0
Order By: Relevance
“…In order to assess the weight of synaptic renewal, the nonlinearity is extracted. 51 The nonlinearity of n/p-type channels was (1.22, −1.37)/(1.89, −2.63) as shown in Figure 2(e,g), manifesting that our carbon nanotube 2T-based synapse had good linearity. In addition, under the excitation of 50 consecutive same pulses (±7 V amplitude, 6 μs duration), the ON/OFF ratio of n/p-channels reached 55 (22/0.4 μS).…”
mentioning
confidence: 67%
See 2 more Smart Citations
“…In order to assess the weight of synaptic renewal, the nonlinearity is extracted. 51 The nonlinearity of n/p-type channels was (1.22, −1.37)/(1.89, −2.63) as shown in Figure 2(e,g), manifesting that our carbon nanotube 2T-based synapse had good linearity. In addition, under the excitation of 50 consecutive same pulses (±7 V amplitude, 6 μs duration), the ON/OFF ratio of n/p-channels reached 55 (22/0.4 μS).…”
mentioning
confidence: 67%
“…Figure (d,f) describes the weight updating characteristics of the p-channel and n-channel under 50 sequential programming signals of ±7 V amplitude, 6 μs duration. In order to assess the weight of synaptic renewal, the nonlinearity is extracted . The nonlinearity of n/p-type channels was (1.22, −1.37)/(1.89, −2.63) as shown in Figure (e,g), manifesting that our carbon nanotube 2T-based synapse had good linearity.…”
mentioning
confidence: 86%
See 1 more Smart Citation
“…The memory of 2D semiconductors is often combined with computational processing, since the stored state can finely regulate the conductance of the 2D semiconductor, and then act with the input signal to obtain the processed output. This is similar to applications in neuromorphic computing, where there has been a large amount of research linking device behaviour to the mapping of neuro-synaptic behaviour, but for 2D semiconductors is mostly limited to functional demonstrations of single devices [162][163][164] . Considering the variation between multiple devices and achieving integration at the array level is critical to implementing large-scale systems that highlight the advantages of 2D.…”
Section: Functional Array Functionsmentioning
confidence: 97%
“…Long-term potentiation and depression were also achieved in this device (Figure e), and the synaptic weight change varied with the timing difference between incoming pre- and postsynaptic spikes, resulting in spike-timing-dependent plasticity (STDP) that was realized by the carefully designed voltage waveform (Figure f) . Similar FeFETs have also been implemented in artificial neural networks for neuromorphic applications including associative learning, reinforcement learning, and recognition tasks. …”
Section: Applications Of Ferroelectrics-integrated 2d Devicesmentioning
confidence: 99%