Code Generation
Code generation research focuses on using large language models (LLMs) to automatically produce functional and secure code from natural language descriptions or other inputs. Current efforts concentrate on improving the accuracy and efficiency of code generation, including developing novel training objectives like horizon-length prediction and employing techniques such as multi-agent frameworks, Monte Carlo Tree Search, and prompt engineering to guide LLMs towards better solutions. This field is significant because it promises to dramatically increase developer productivity and accelerate software development, while also raising important questions about code security and reliability that require further investigation.
Papers
What Makes Large Language Models Reason in (Multi-Turn) Code Generation?
Kunhao Zheng, Juliette Decugis, Jonas Gehring, Taco Cohen, Benjamin Negrevergne, Gabriel Synnaeve
Mitigating Gender Bias in Code Large Language Models via Model Editing
Zhanyue Qin, Haochuan Wang, Zecheng Wang, Deyuan Liu, Cunhang Fan, Zhao Lv, Zhiying Tu, Dianhui Chu, Dianbo Sui
A Survey on LLM-based Code Generation for Low-Resource and Domain-Specific Programming Languages
Sathvik Joel, Jie JW Wu, Fatemeh H. Fard
Steering Large Language Models between Code Execution and Textual Reasoning
Yongchao Chen, Harsh Jhamtani, Srinagesh Sharma, Chuchu Fan, Chi Wang
Generating Equivalent Representations of Code By A Self-Reflection Approach
Jia Li, Ge Li, Lecheng Wang, Hao Zhu, Zhi Jin
Showing LLM-Generated Code Selectively Based on Confidence of LLMs
Jia Li, Yuqi Zhu, Yongmin Li, Ge Li, Zhi Jin
Tadashi: Enabling AI-Based Automated Code Generation With Guaranteed Correctness
Emil Vatai, Aleksandr Drozd, Ivan R. Ivanov, Yinghao Ren, Mohamed Wahib
Horizon-Length Prediction: Advancing Fill-in-the-Middle Capabilities for Code Generation with Lookahead Planning
Yifeng Ding, Hantian Ding, Shiqi Wang, Qing Sun, Varun Kumar, Zijian Wang
Codev-Bench: How Do LLMs Understand Developer-Centric Code Completion?
Zhenyu Pan, Rongyu Cao, Yongchang Cao, Yingwei Ma, Binhua Li, Fei Huang, Han Liu, Yongbin Li
RGD: Multi-LLM Based Agent Debugger via Refinement and Generation Guidance
Haolin Jin, Zechao Sun, Yiheng Yang, Huaming Chen
From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging
Yuling Shi, Songsong Wang, Chengcheng Wan, Xiaodong Gu