A New Approach to Web Application Security: Utilizing GPT Language Models for Source Code Inspection

Zoltán Szabó,Vilmos Bilicki

doi:10.3390/fi15100326

Abstract

Due to the proliferation of large language models (LLMs) and their widespread use in applications such as ChatGPT, there has been a significant increase in interest in AI over the past year. Multiple researchers have raised the question: how will AI be applied and in what areas? Programming, including the generation, interpretation, analysis, and documentation of static program code based on promptsis one of the most promising fields. With the GPT API, we have explored a new aspect of this: static analysis of the source code of front-end applications at the endpoints of the data path. Our focus was the detection of the CWE-653 vulnerability—inadequately isolated sensitive code segments that could lead to unauthorized access or data leakage. This type of vulnerability detection consists of the detection of code segments dealing with sensitive data and the categorization of the isolation and protection levels of those segments that were previously not feasible without human intervention. However, we believed that the interpretive capabilities of GPT models could be explored to create a set of prompts to detect these cases on a file-by-file basis for the applications under study, and the efficiency of the method could pave the way for additional analysis tasks that were previously unavailable for automation. In the introduction to our paper, we characterize in detail the problem space of vulnerability and weakness detection, the challenges of the domain, and the advances that have been achieved in similarly complex areas using GPT or other LLMs. Then, we present our methodology, which includes our classification of sensitive data and protection levels. This is followed by the process of preprocessing, analyzing, and evaluating static code. This was achieved through a series of GPT prompts containing parts of static source code, utilizing few-shot examples and chain-of-thought techniques that detected sensitive code segments and mapped the complex code base into manageable JSON structures.Finally, we present our findings and evaluation of the open source project analysis, comparing the results of the GPT-based pipelines with manual evaluations, highlighting that the field yields a high research value. The results show a vulnerability detection rate for this particular type of model of 88.76%, among others.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Future Internet	Publication Date: Sep 28, 2023
Citations: 9	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A New Approach to Web Application Security: Utilizing GPT Language Models for Source Code Inspection

Abstract

Talk to us

Similar Papers

More From: Future Internet

Lead the way for us

Similar Papers

A Comparative Study of Static Code Analysis tools for Vulnerability Detection in C/C++ and JAVA Source Code
Arvinder Kaur ... Ruchikaa Nayyar
Procedia Computer Science | VOL. 171
Arvinder Kaur, et. al.Arvinder Kaur ... Ruchikaa Nayyar
01 Jan 2020
Procedia Computer Science | VOL. 171

XLMR4MD: New Vietnamese dataset and framework for detecting the consistency of description and permission in Android applications using large language models
Qui Ngoc Nguyen ... Kiet Van Nguyen
Computers & Security | VOL. 140
Qui Ngoc Nguyen, et. al.Qui Ngoc Nguyen ... Kiet Van Nguyen
15 Mar 2024
Computers & Security | VOL. 140

Static Code Analysis Tool for Laravel Framework Based Web Application
Ranindya Paramitha ... Yudistira Dwi Wardhana Asnar
-
Ranindya Paramitha, et. al.Ranindya Paramitha ... Yudistira Dwi Wardhana Asnar
03 Nov 2021
03 Nov 2021

Enhanced Bug Prediction in JavaScript Programs with Hybrid Call-Graph Based Invocation Metrics
Gábor Antal ... Péter Hegedűs
Technologies | VOL. 9
Gábor Antal, et. al.Gábor Antal ... Péter Hegedűs
30 Dec 2020
Technologies | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A New Approach to Web Application Security: Utilizing GPT Language Models for Source Code Inspection

Abstract

Talk to us

Similar Papers

More From: Future Internet