Paper ID: 2403.16861
DISL: Fueling Research with A Large Dataset of Solidity Smart Contracts
Gabriele Morello, Mojtaba Eshghie, Sofia Bobadilla, Martin Monperrus
The DISL dataset features a collection of $514,506$ unique Solidity files that have been deployed to Ethereum mainnet. It caters to the need for a large and diverse dataset of real-world smart contracts. DISL serves as a resource for developing machine learning systems and for benchmarking software engineering tools designed for smart contracts. By aggregating every verified smart contract from Etherscan up to January 15, 2024, DISL surpasses existing datasets in size and recency.
Submitted: Mar 25, 2024