CircuitNet: An Open-Source Dataset for Machine Learning in VLSI CAD Applications With Improved Domain-Specific Evaluation Metric and Learning Strategies

Zhuomin Chai,Runsheng Wang,Yibo Lin,Ru Huang,Wei Liu,Yuxiang Zhao

doi:10.1109/tcad.2023.3287970

Abstract

The design automation community has been actively exploring machine learning for VLSI CAD. Many studies have explored learning-based techniques for cross-stage prediction tasks in the design flow. Although building machine learning models usually requires a large amount of data, most studies can only generate small internal datasets for validation due to the lack of large public datasets. Such a situation challenges the research in this field and raises potential issues like difficulty in benchmarking and reproducing results, limited research scope on small internal datasets, and high bar for new researchers. Therefore, in this paper, we present an open-source dataset called “CircuitNet” for machine learning tasks in VLSI CAD. The dataset consists of more than 10K samples extracted from versatile runs of commercial design tools based on 6 open-source RISC-V designs which support typical cross-stage prediction tasks, such as routability and IR drop prediction, with extensive benchmarking on recent models. With the dataset prepared, we identify two practical challenges, data imbalance and model transferability, for machine learning application in CAD. To overcome data imbalance, we propose a loss function, biased loss, to give more weight to the minority, leading to 2% congestion reduction in routability driven placement. We test the model transferability from RISC-V designs to ISPD 2015 contest designs in congestion prediction with several transfer learning methods, and further proposed a knowledge distillation based transfer learning framework with up to 20% accuracy improvement. We believe this dataset can open up new opportunities for machine learning in CAD research and beyond.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CircuitNet: An Open-Source Dataset for Machine Learning in VLSI CAD Applications With Improved Domain-Specific Evaluation Metric and Learning Strategies

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

Lead the way for us

Journal: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems	Publication Date: Dec 1, 2023
Citations: 13

Similar Papers

Sex-Based Performance Disparities in Machine Learning Algorithms for Cardiac Disease Prediction: Exploratory Study.
Isabel Straw ... Parashkev Nachev
Journal of medical Internet research | VOL. 26
Isabel Straw, et. al.Isabel Straw ... Parashkev Nachev
03 Mar 2023
Journal of medical Internet research | VOL. 26

Building machine learning models without sharing patient data: A simulation-based analysis of distributed learning by ensembling.
Anup Tuladhar ... Nils D Forkert
Journal of Biomedical Informatics | VOL. 106
Anup Tuladhar, et. al.Anup Tuladhar ... Nils D Forkert
23 Apr 2020
Journal of Biomedical Informatics | VOL. 106

A Data-centric AI Framework for Automating Exploratory Data Analysis and Data Quality Tasks
Hima Patel ... Shanmukha Guttula
Journal of Data and Information Quality | VOL. 15
Hima Patel, et. al.Hima Patel ... Shanmukha Guttula
01 Nov 2023
Journal of Data and Information Quality | VOL. 15

Issue of Data Imbalance on Low Birthweight Baby Outcomes Prediction and Associated Risk Factors Identification: Establishment of Benchmarking Key Machine Learning Models With Data Rebalancing Strategies.
Yang Ren ... Ana López-Defede
Journal of Medical Internet Research | VOL. 25
Yang Ren, et. al.Yang Ren ... Ana López-Defede
31 May 2023
Journal of Medical Internet Research | VOL. 25

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CircuitNet: An Open-Source Dataset for Machine Learning in VLSI CAD Applications With Improved Domain-Specific Evaluation Metric and Learning Strategies

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems