
SCAD is a large-scale dataset of Solidity smart contracts intended to support empirical research in smart contract security analysis, EVM bytecode analysis, machine learning for vulnerability detection, and benchmarking of analysis tools. The dataset provides 513,639 Solidity source files sourced from the DISL collection, compiled across 89 compiler versions (0.4.0-0.8.23) and analyzed using Slither static analysis framework (v11.0) with a custom structural metrics detector, producing 286,638 static analysis reports, 286,638 bytecode artifact sets (creation bytecode, runtime bytecode, EVM opcodes), and 227,001 compiler error logs. CSV files containing per-contract structural metrics, issue counts, and a master index of all dataset components are also included, along with all processing scripts to reproduce the dataset.
Ethereum, Blockchain, Vulnerability Detection, EVM, Slither, Solidity Compiler, Bytecode
Ethereum, Blockchain, Vulnerability Detection, EVM, Slither, Solidity Compiler, Bytecode
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
