Benchmarking Large Language Models for Automated Verilog RTL Code Generation

Shailja Thakur,Ramesh Karri,Zhenxing Fan,Brendan Dolan-Gavitt,Hammond Pearce,Baleegh Ahmad,Benjamin Tan,Siddharth Garg

doi:10.23919/date56975.2023.10137086

Abstract

Automating hardware design could obviate a signif-icant amount of human error from the engineering process and lead to fewer errors. Verilog is a popular hardware description language to model and design digital systems, thus generating Verilog code is a critical first step. Emerging large language models (LLMs) are able to write high-quality code in other programming languages. In this paper, we characterize the ability of LLMs to generate useful Verilog. For this, we fine-tune pre-trained LLMs on Verilog datasets collected from GitHub and Verilog textbooks. We construct an evaluation framework comprising test-benches for functional analysis and a flow to test the syntax of Verilog code generated in response to problems of varying difficulty. Our findings show that across our problem scenarios, the fine-tuning results in LLMs more capable of producing syntactically correct code (25.9% overall). Further, when analyzing functional correctness, a fine-tuned open-source CodeGen LLM can outperform the state-of-the-art commercial Codex LLM (6.5% overall). We release our training/evaluation scripts and LLM checkpoints as open source contributions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Benchmarking Large Language Models for Automated Verilog RTL Code Generation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

VeriGen: A Large Language Model for Verilog Code Generation
Shailja Thakur ... Siddharth Garg
ACM Transactions on Design Automation of Electronic Systems | VOL. 29
Shailja Thakur, et. al.Shailja Thakur ... Siddharth Garg
22 Apr 2024
ACM Transactions on Design Automation of Electronic Systems | VOL. 29

Optimization of traditional methods for determining the similarity of project names and purchases using large language models
Aleksei Aleksandrovich Golikov ... Yuliya Danilova
Litera | VOL. -
Aleksei Aleksandrovich Golikov, et. al.Aleksei Aleksandrovich Golikov ... Yuliya Danilova
01 Apr 2024
Litera | VOL. -

Large language models from OpenAI, Google, Meta, X and Co. : The role of "closed" and "open" models in radiology
Sebastian Nowak ... Alois M Sprinkart
Radiologie (Heidelberg, Germany) | VOL. 64
Sebastian Nowak, et. al.Sebastian Nowak ... Alois M Sprinkart
07 Jun 2024
Radiologie (Heidelberg, Germany) | VOL. 64

The political preferences of LLMs.
David Rozado
PloS one | VOL. 19
David RozadoDavid Rozado
31 Jul 2024
PloS one | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Benchmarking Large Language Models for Automated Verilog RTL Code Generation

Abstract

Talk to us

Similar Papers