Designing Efficient and High-Performance AI Accelerators With Customized STT-MRAM

Kaniz Mishty,Mehdi Sadi

doi:10.1109/tvlsi.2021.3105958

Abstract

We demonstrate the design of efficient and high-performance artificial intelligence (AI)/deep learning accelerators with customized spin transfer torque (STT)-MRAM (STT-MRAM) and a reconfigurable core. Based on model-driven detailed design space exploration, we present the design methodology of an innovative scratchpad-assisted on-chip STT-MRAM-based buffer system for high-performance accelerators. Using analytically derived expression of memory occupancy time of AI model weights and activation maps, the volatility of STT-MRAM is adjusted with process and temperature variation aware scaling of thermal stability factor to optimize the retention time, energy, read/write latency, and area of STT-MRAM. From the analysis of AI workloads and accelerator implementation in 14-nm technology, we verify the efficacy of our AI accelerator with STT-MRAM (STT-AI). Compared to an SRAM-based implementation, the STT-AI accelerator achieves 75% area and 3% power savings at isoaccuracy. Furthermore, with a relaxed bit error rate and negligible AI accuracy tradeoff, the designed STT-AI Ultra accelerator achieves 75.4% and 3.5% savings in area and power, respectively, over regular SRAM-based accelerators.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Designing Efficient and High-Performance AI Accelerators With Customized STT-MRAM

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Very Large Scale Integration (VLSI) Systems

Lead the way for us

Journal: IEEE Transactions on Very Large Scale Integration (VLSI) Systems	Publication Date: Oct 1, 2021
Citations: 14

Similar Papers

Low-Power High-Density STT MRAMs on a 3-D Vertical Silicon Nanowire Platform
Shivam Verma ... Brajesh Kumar Kaushik
IEEE Transactions on Very Large Scale Integration (VLSI) Systems | VOL. 24
Shivam Verma, et. al.Shivam Verma ... Brajesh Kumar Kaushik
01 Apr 2016
IEEE Transactions on Very Large Scale Integration (VLSI) Systems | VOL. 24

Union Bound Analysis for Spin-Torque Transfer Magnetic Random Access Memory with Channel Quantization
...
-
, et. al. ...
30 Mar 2021
30 Mar 2021

Dependence of Voltage and Size on Write Error Rates in Spin-Transfer Torque Magnetic Random-Access Memory
Janusz J Nowak ... Junghyuk Lee
IEEE Magnetics Letters | VOL. 7
Janusz J Nowak, et. al.Janusz J Nowak ... Junghyuk Lee
01 Jan 2015
IEEE Magnetics Letters | VOL. 7

Layout-aware optimization of stt mrams
S K Gupta ... N N Mojumder
-
S K Gupta, et. al.S K Gupta ... N N Mojumder
01 Mar 2012
01 Mar 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Designing Efficient and High-Performance AI Accelerators With Customized STT-MRAM

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Very Large Scale Integration (VLSI) Systems