AI SoC Design Challenges in the Foundation Model Era

Zhengyu Chen,Lu Yuan,Viren Shah,Gao Deng,Sumti Jairath,Yun Du,Bo Li,Jun Yang,Bowen Yang,Yongning Sheng,Venkat Srinivasan,Mingran Wang,Sihua Fu,Dawei Huang,Chen Liu,Raghu Prabhakar,Jinuk Luke Shin,Tian Zhao,Changran Hu

doi:10.1109/cicc57935.2023.10121242

Abstract

Despite its unprecedented prevalence, a foundation model’s exponentially growing training cost, dataset sizes, and model capacity hinder the democratization of modern AI technology and require novel system design solutions. In this paper, we review state-of-the-art (SOTA) challenges and methodologies in scaling AI system-on-chip (SoC) design to harness the power of the foundation model. We organize our discussions four-fold. First, we discuss AI SoC architecture design to enable high-performance training for foundation models. Second, we discuss challenges in managing foundation model training with dataflow accelerators. We show that data flow accelerators, a class of promising architectures removing execution bottlenecks through overlapping computation and data fetching, pose new challenges for hardware resource mapping and allocation. Third, we discuss challenges for exploiting parallelism encompassing multiple dimensions, e.g., tensor, model, and data-level parallelism. Mapping models over tensor and model dimensions enables large model training at the cost of introducing distributed and orchestrated gradient synchronization. Last, we discuss electrical and energy design trade-offs for implementing massive computation and memory units capturing computation and data locality on a dataflow accelerator. The solution to all four aspects lies at the intersection of system-aware machine learning algorithms, dataflow-driven software systems, and scalable hardware design.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

AI SoC Design Challenges in the Foundation Model Era

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Foundation model for cancer imaging biomarkers
Suraj Pai ... Ibrahim Hadzic
Nature Machine Intelligence | VOL. 6
Suraj Pai, et. al.Suraj Pai ... Ibrahim Hadzic
01 Mar 2024
Nature Machine Intelligence | VOL. 6

Automated classification of fauna in seabed photographs: The impact of training and validation dataset size, with considerations for the class imbalance
Jennifer M Durden ... Henry A Ruhl
Progress in Oceanography | VOL. 196
Jennifer M Durden, et. al.Jennifer M Durden ... Henry A Ruhl
20 May 2021
Progress in Oceanography | VOL. 196

Weighted Gaussian Process Regression for Single Image Super-resolution Based on Randomized Sample Clustering and Augmentation
Chao Guo ... Jun Zhang
-
Chao Guo, et. al.Chao Guo ... Jun Zhang
28 Jun 2021
28 Jun 2021

Exploring Transfer Learning Using Segment Anything Model in Optical Remote Sensing
Mohanad Albughdadi ... Tolga Kaprol
-
Mohanad Albughdadi, et. al.Mohanad Albughdadi ... Tolga Kaprol
08 Mar 2024
08 Mar 2024

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AI SoC Design Challenges in the Foundation Model Era

Abstract

Talk to us

Similar Papers