Abstract

AbstractMost existing deep learning-based source code vulnerability detection methods focus on the design of different deep learning algorithms to improve the accuracy of source code vulnerability detection, ignoring the obvious problems: firstly, lack of sufficient source code vulnerability data; secondly, lack of high-quality code data for deep learning algorithms. Therefore, we propose a code attribute graph-based data generation method for deep learning based source code vulnerability detection. The proposed method tries to represent source code based on code attribute graph, which is used to extract the control flow information and data flow information of source code sequentially. And the code data can be generated by retrieval-matching method. The advantage of this method is that it can extract rich semantic information of source code, and the generated code slices can be used for deep learning algorithms directly. Experimental results show that the proposed method can generate a large number of high-quality code data, which can provide data for deep learning-based source code vulnerability detection.KeywordsVulnerability dataset generationCode property graph/Code attribute graphData flowControl flowSource code vulnerability detectionDeep learning

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.