Large Language Model-Powered Smart Contract Vulnerability Detection: New Perspectives

Name: Large Language Model-Powered Smart Contract Vulnerability Detection: New Perspectives
Keywords: FOS: Computer and information sciences, Computer Science - Cryptography and Security, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Cryptography and Security (cs.CR)

Sihao Hu; Tiansheng Huang; Fatih Ilhan; Selim Furkan Tekin; Ling Liu 0001

Found an issue? Give us feedback

arXiv.org e-Print Ar...arrow_drop_down

arXiv.org e-Print Archive

Preprint . 2023

Data sources: arXiv.org e-Print Archive

Orvium

Article

Data sources: Orvium

https://doi.org/10.1109/tps-is...

Article . 2023 . Peer-reviewed

License: STM Policy #29

Data sources: Crossref

https://dx.doi.org/10.48550/ar...

Article . 2023

License: arXiv Non-Exclusive Distribution

Data sources: Datacite

DBLP

Article

Data sources: DBLP

DBLP

Conference object

Data sources: DBLP

Large Language Model-Powered Smart Contract Vulnerability Detection: New Perspectives

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 01 Nov 2023Embargo end date: 01 Jan 2023Publisher:IEEEJournal:2023 5th IEEE International Conference on Trust, Privacy and Security in Intelligent Systems and Applications (TPS-ISA)Funded by:NSF | SHF: Medium: Cross-Stack ..., NSF | EAGER: SaTC-EDU: Privacy ..., NSF | NSF-CSIRO: RAI4IoE: Respo...

Authors: Sihao Hu; Tiansheng Huang; Fatih Ilhan; Selim Furkan Tekin; Ling Liu 0001;

doi: 10.1109/tps-isa58951.2023.00044 , 10.48550/arxiv.2310.01152

arXiv: 2310.01152

Large Language Model-Powered Smart Contract Vulnerability Detection: New Perspectives

- Summary
- Subjects
- Related research
  (2)
- Metrics

Abstract

This paper provides a systematic analysis of the opportunities, challenges, and potential solutions of harnessing Large Language Models (LLMs) such as GPT-4 to dig out vulnerabilities within smart contracts based on our ongoing research. For the task of smart contract vulnerability detection, achieving practical usability hinges on identifying as many true vulnerabilities as possible while minimizing the number of false positives. Nonetheless, our empirical study reveals contradictory yet interesting findings: generating more answers with higher randomness largely boosts the likelihood of producing a correct answer but inevitably leads to a higher number of false positives. To mitigate this tension, we propose an adversarial framework dubbed GPTLens that breaks the conventional one-stage detection into two synergistic stages $-$ generation and discrimination, for progressive detection and refinement, wherein the LLM plays dual roles, i.e., auditor and critic, respectively. The goal of auditor is to yield a broad spectrum of vulnerabilities with the hope of encompassing the correct answer, whereas the goal of critic that evaluates the validity of identified vulnerabilities is to minimize the number of false positives. Experimental results and illustrative examples demonstrate that auditor and critic work together harmoniously to yield pronounced improvements over the conventional one-stage detection. GPTLens is intuitive, strategic, and entirely LLM-driven without relying on specialist expertise in smart contracts, showcasing its methodical generality and potential to detect a broad spectrum of vulnerabilities. Our code is available at: https://github.com/git-disl/GPTLens.

10 pages

Related Organizations

Georgia Institute of Technology
United States
GEORGIA TECH RESEARCH CORPORATION
United States

Keywords

FOS: Computer and information sciences, Computer Science - Cryptography and Security, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Cryptography and Security (cs.CR)

2 Research products, page 1 of 1

GPTLens software on GitHub
IsRelatedTo
mythril software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	8
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

8

Top 10%

Green

bronze

Funded by

NSF| SHF: Medium: Cross-Stack Algorithm-Hardware-Systems Optimization Towards Ubiquitous On-Device 3D Intelligence, NSF| EAGER: SaTC-EDU: Privacy Enhancing Techniques and Innovations for AI-Cybersecurity Cross Training, NSF| NSF-CSIRO: RAI4IoE: Responsible AI for Enabling the Internet of Energy

Large Language Model-Powered Smart Contract Vulnerability Detection: New Perspectives

Large Language Model-Powered Smart Contract Vulnerability Detection: New Perspectives

2 Research products, page 1 of 1

GPTLens software on GitHub

mythril software on GitHub