Scalable Transformer Accelerator with Variable Systolic Array for Multiple Models in Voice Assistant Applications

Seok-Woo Chang,Dong-Sun Kim

doi:10.3390/electronics13234683

Abstract

Transformer model is a type of deep learning model that has quickly become fundamental in natural language processing (NLP) and other machine learning tasks. Transformer hardware accelerators are usually designed for specific models, such as Bidirectional Encoder Representations from Transformers (BERT), and vision Transformer models, like the ViT. In this study, we propose a Scalable Transformer Accelerator Unit (STAU) for multiple models, enabling efficient handling of various Transformer models used in voice assistant applications. Variable Systolic Array (VSA) centralized design, along with control and data preprocessing in embedded processors, enables matrix operations of varying sizes. In addition, we propose an efficient variable structure and a row-wise data input method for natural language processing where the word count changes. The proposed scalable Transformer accelerator accelerates text summarization, audio processing, image search, and generative AI used in voice assistance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Scalable Transformer Accelerator with Variable Systolic Array for Multiple Models in Voice Assistant Applications

Abstract

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Journal: Electronics	Publication Date: Nov 27, 2024
License type: CC BY 4.0

Similar Papers

Chapter 6 - Multifractal complexity analysis-based dynamic media text categorization models by natural language processing with BERT
Yeliz Karaca ... Shui-Hua Wang
Multi-Chaos, Fractal and Multi-Fractional Artificial Intelligence of Different Complex Systems | VOL. -
Yeliz Karaca, et. al.Yeliz Karaca ... Shui-Hua Wang
01 Jan 2021
Multi-Chaos, Fractal and Multi-Fractional Artificial Intelligence of Different Complex Systems | VOL. -

Bidirectional encoders to state-of-the-art: a review of BERT and its transformative impact on natural language processing
Rajesh Gupta
Информатика. Экономика. Управление - Informatics. Economics. Management | VOL. 3
Rajesh GuptaRajesh Gupta
02 Mar 2024
Информатика. Экономика. Управление - Informatics. Economics. Management | VOL. 3

Chinese news topic prediction using bidirectional encoder representation from transformers
Yifan Bi
Theoretical and Natural Science | VOL. 18
Yifan BiYifan Bi
08 Dec 2023
Theoretical and Natural Science | VOL. 18

Financial Report Sentiment Analysis Using Loughran-mcdonald Dictionary and BERT
Sheetal R ... Prakash K Aithal
Financial Engineering | VOL. 2
Sheetal R, et. al.Sheetal R ... Prakash K Aithal
24 Jun 2024
Financial Engineering | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Scalable Transformer Accelerator with Variable Systolic Array for Multiple Models in Voice Assistant Applications

Abstract

Talk to us

Similar Papers

More From: Electronics