Ferroelectric Polarized in Transistor Channel Polarity Modulation for Reward-Modulated Spike-Time-Dependent Plasticity Application.

Yanmei Sun,Nian He,Qi Yuan,Yufei Wang,Dianzhong Wen,Yan Dong

doi:10.1021/acs.jpclett.2c03007

Abstract

Reward signals reflect the developmental tendency of reinforcement learning (RL) agents. Reward-modulated spike-time-dependent plasticity (R-STDP) is an efficient and concise information processing feature in RL. However, the physical construction of R-STDP normally demands complex circuit design engineering, resulting in large power consumption and large area. In this work, we studied the role of ferroelectric polarization in the modulation of carbon nanotube transistor channel polarity. Furthermore, we applied a modulating channel method to construct a 2T synaptic component by spin-coating technology. Based on the nonvolatility of ferroelectric polarization, the synaptic component constructed has the characteristics of reconfigurable polarity. One channel could be modulated to n-type and the other to p-type. One modulated channel was used to perform the STDP function when the reward signal arrived, and the other modulated channel was used to perform the anti-STDP function when the punishment signal arrived. Finally, R-STDP learning rules are implemented on hardware. This work provides a strategy for hardware construction of RL.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Ferroelectric Polarized in Transistor Channel Polarity Modulation for Reward-Modulated Spike-Time-Dependent Plasticity Application.

Abstract

Talk to us

Similar Papers

More From: The journal of physical chemistry letters

Lead the way for us

Journal: The journal of physical chemistry letters	Publication Date: Oct 20, 2022
Citations: 3

Similar Papers

Reward tampering problems and solutions in reinforcement learning: a causal influence diagram perspective
Tom Everitt ... Ramana Kumar
Synthese | VOL. 198
Tom Everitt, et. al.Tom Everitt ... Ramana Kumar
19 May 2021
Synthese | VOL. 198

Deep Reinforcement Learning with Different Rewards for Scheduling in High-Performance Computing Systems
Md Farhadur Reza ... Bo Zhao
-
Md Farhadur Reza, et. al.Md Farhadur Reza ... Bo Zhao
09 Aug 2021
09 Aug 2021

Artificial Intelligence and the Common Sense of Animals.
Murray Shanahan ... Benjamin Beyret
Trends in Cognitive Sciences | VOL. 24
Murray Shanahan, et. al.Murray Shanahan ... Benjamin Beyret
08 Oct 2020
Trends in Cognitive Sciences | VOL. 24

Reinforcement learning from simultaneous human and MDP reward
...
-
, et. al. ...
04 Jun 2012
04 Jun 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Ferroelectric Polarized in Transistor Channel Polarity Modulation for Reward-Modulated Spike-Time-Dependent Plasticity Application.

Abstract

Talk to us

Similar Papers

More From: The journal of physical chemistry letters