Digitala Vetenskapliga Arkivet

Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
An Analysis of Generative AI Capabilities in Security Testing: Evaluating Static Code Analysis Performance
Blekinge Institute of Technology, Faculty of Computing, Department of Software Engineering.
2025 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

Background – In today's technology-driven world, ensuring the security of software systems is paramount due to increasing dependency on these systems across all sectors. Security testing, specifically static code analysis, plays a vital role in detecting vulnerabilities before they are exploited. Traditional static analysis tools, such as SonarQube, often struggle to detect complex vulnerabilities, prompting the exploration of Artificial Intelligence (AI) for enhanced security testing.

Objectives – This thesis aims to evaluate the performance of two Generative AI models, ChatGPT and Gemini, in static code analysis for security testing and compare these AI models with each other and with a traditional static code analysis tool, SonarQube, to determine their effectiveness in detecting software vulnerabilities.

Methods – Method used in this thesis is experimentation which enabled me to gather empirical evidence through a controlled environment with controlled variables. It enabled me to compare the performance of ChatGPT, Gemini & SonarQube, this comparison also helped me in identifying a superior performing model.

Results – Both AI models outperformed SonarQube in vulnerability detection. ChatGPT demonstrated slightly better performance in identifying the specific code responsible for vulnerabilities compared to Gemini.

Conclusions – Through the course of this thesis it has become evident that GenAI models offer solid performance when it comes to static code analysis in vulnerability assessment. They show promise and have presented their case by showcasing their superior performance, that they are very much able to assist or even replace the traditional SAST tools in some scenarios.

Place, publisher, year, edition, pages
2025. , p. 35
Keywords [en]
Security Testing, Static Code Analysis, Vulnerability Detection, Generative Artificial Intelligence
National Category
Software Engineering
Identifiers
URN: urn:nbn:se:bth-27505OAI: oai:DiVA.org:bth-27505DiVA, id: diva2:1941555
Subject / course
PA2534 Master's Thesis (120 credits) in Software Engineering
Educational program
PAASW Master's Programme in Software Engineering 120,0 hp
Supervisors
Examiners
Available from: 2025-03-03 Created: 2025-02-28 Last updated: 2025-03-03Bibliographically approved

Open Access in DiVA

fulltext(3369 kB)330 downloads
File information
File name FULLTEXT01.pdfFile size 3369 kBChecksum SHA-512
1b896801597e265e0a793233d3f2fef7185d1f6b4f0c429eff46a64cd23fea566ea758f608934438770dcb53dbbc76ad2449c0bc79cdc419fc790ee7e0fc7cf1
Type fulltextMimetype application/pdf

By organisation
Department of Software Engineering
Software Engineering

Search outside of DiVA

GoogleGoogle Scholar
Total: 330 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 210 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf