Code Generation

Code generation research focuses on using large language models (LLMs) to automatically produce functional and secure code from natural language descriptions or other inputs. Current efforts concentrate on improving the accuracy and efficiency of code generation, including developing novel training objectives like horizon-length prediction and employing techniques such as multi-agent frameworks, Monte Carlo Tree Search, and prompt engineering to guide LLMs towards better solutions. This field is significant because it promises to dramatically increase developer productivity and accelerate software development, while also raising important questions about code security and reliability that require further investigation.

Papers

May 13, 2024

Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots
Chengyue Wu, Yixiao Ge, Qiushan Guo, Jiahao Wang, Zhixuan Liang, Zeyu Lu, Ying Shan, Ping Luo
Code Generation Multi Modal Large Language Model Comprehensive Benchmark Visual Programming

May 10, 2024

Execution-Based Evaluation of Natural Language to Bash and PowerShell for Incident Remediation
Ngoc Phuoc An Vo, Brent Paulovicks, Vadim Sheinin
Code Generation Code Quality Execution Based

May 7, 2024

May 3, 2024

CodeGRAG: Bridging the Gap between Natural Language and Programming Language via Graphical Retrieval Augmented Generation
Kounianhua Du, Jizheng Chen, Renting Rui, Huacan Chai, Lingyue Fu, Wei Xia, Yasheng Wang, Ruiming Tang, Yong Yu, Weinan Zhang
Code Generation Retrieval Augmented Code Generation Task Code Generation Ability Syntax Graph Retrieval Augmented Code Generation

April 30, 2024

April 29, 2024

April 26, 2024

April 25, 2024

AI Coders Are Among Us: Rethinking Programming Language Grammar Towards Efficient Code Generation
Zhensu Sun, Xiaoning Du, Zhou Yang, Li Li, David Lo
Artificial Intelligence Code Generation Artificial Intelligence Model Grammar Design

April 24, 2024

CodeIP: A Grammar-Guided Multi-Bit Watermark for Large Language Models of Code
Batu Guan, Yao Wan, Zhangqian Bi, Zheng Wang, Hongyu Zhang, Pan Zhou, Lichao Sun
Code Generation Real World Code Machine Generated Data Provenance LLM Based Code Generation Bit Watermarking

April 23, 2024

Beyond Code Generation: An Observational Study of ChatGPT Usage in Software Engineering Practice
Ranim Khojah, Mazen Mohamad, Philipp Leitner, Francisco Gomes de Oliveira Neto
Code Generation Chatbot Response Software Engineering LLM Use Observational Study

April 22, 2024

April 20, 2024

Large Language Models as Test Case Generators: Performance Evaluation and Enhancement
Kefan Li, Yuan Yuan
Code Generation Feature Enhancement Performance Evaluation Test Bed

April 17, 2024

Low-Cost Language Models: Survey and Performance Evaluation on Python Code Generation
Jessica López Espejel, Mahaman Sanoussi Yahaya Alassan, Merieme Bouhandi, Walid Dahhane, El Hassane Ettifouri
Language Model Natural Language Processing Timely Survey Code Generation Comprehensive Evaluation Performance Evaluation Many Natural Language Processing State of the Art Datasets

April 15, 2024

The Fault in our Stars: Quality Assessment of Code Generation Benchmarks
Mohammed Latif Siddiq, Simantika Dristi, Joy Saha, Joanna C. S. Santos
Large Language Model Code Generation Complex Prompt Quality Assessment Code Generation Model Code Generation Benchmark

Code Generation

Papers

Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots

Execution-Based Evaluation of Natural Language to Bash and PowerShell for Incident Remediation

Granite Code Models: A Family of Open Foundation Models for Code Intelligence

Sketch Then Generate: Providing Incremental User Feedback and Guiding LLM Code Generation through Language-Oriented Code Sketches

CodeGRAG: Bridging the Gap between Natural Language and Programming Language via Graphical Retrieval Augmented Generation

CodeHalu: Investigating Code Hallucinations in LLMs via Execution-based Verification

Constrained Decoding for Secure Code Generation

Performance-Aligned LLMs for Generating Fast Code

PECC: Problem Extraction and Coding Challenges

How secure is AI-generated Code: A Large-Scale Comparison of Large Language Models

On the Limitations of Embedding Based Methods for Measuring Functional Correctness for Code Generation

"ChatGPT Is Here to Help, Not to Replace Anybody" -- An Evaluation of Students' Opinions On Integrating ChatGPT In CS Courses

AI Coders Are Among Us: Rethinking Programming Language Grammar Towards Efficient Code Generation

CodeIP: A Grammar-Guided Multi-Bit Watermark for Large Language Models of Code

Beyond Code Generation: An Observational Study of ChatGPT Usage in Software Engineering Practice

Assessing GPT-4-Vision's Capabilities in UML-Based Code Generation

Class-Level Code Generation from Natural Language Using Iterative, Tool-Enhanced Reasoning over Repository

Large Language Models as Test Case Generators: Performance Evaluation and Enhancement

Low-Cost Language Models: Survey and Performance Evaluation on Python Code Generation

The Fault in our Stars: Quality Assessment of Code Generation Benchmarks