Executable Dataset

Executable datasets are collections of program code, paired with their correct and incorrect versions, designed to facilitate research in automated program repair and other areas of software engineering. Current research focuses on leveraging these datasets to train and evaluate machine learning models, particularly deep neural networks, for tasks such as automatically identifying and fixing bugs, improving code generation from natural language descriptions, and creating more robust and accurate semantic parsers for task-oriented dialogue systems. The availability of such datasets significantly advances the field by enabling data-driven approaches to software development, leading to improved code quality and more efficient development processes.

Papers