Code Generation
Code generation research focuses on using large language models (LLMs) to automatically produce functional and secure code from natural language descriptions or other inputs. Current efforts concentrate on improving the accuracy and efficiency of code generation, including developing novel training objectives like horizon-length prediction and employing techniques such as multi-agent frameworks, Monte Carlo Tree Search, and prompt engineering to guide LLMs towards better solutions. This field is significant because it promises to dramatically increase developer productivity and accelerate software development, while also raising important questions about code security and reliability that require further investigation.
Papers
Task-oriented Prompt Enhancement via Script Generation
Chung-Yu Wang, Alireza DaghighFarsoodeh, Hung Viet Pham
Selection of Prompt Engineering Techniques for Code Generation through Predicting Code Complexity
Chung-Yu Wang, Alireza DaghighFarsoodeh, Hung Viet Pham
MOSS: Enabling Code-Driven Evolution and Context Management for AI Agents
Ming Zhu, Yi Zhou
Eliciting Instruction-tuned Code Language Models' Capabilities to Utilize Auxiliary Function for Code Generation
Seonghyeon Lee, Suyeon Kim, Joonwon Jang, Heejae Chon, Dongha Lee, Hwanjo Yu
Contextualized Data-Wrangling Code Generation in Computational Notebooks
Junjie Huang, Daya Guo, Chenglong Wang, Jiazhen Gu, Shuai Lu, Jeevana Priya Inala, Cong Yan, Jianfeng Gao, Nan Duan, Michael R. Lyu
CraftRTL: High-quality Synthetic Data Generation for Verilog Code Models with Correct-by-Construction Non-Textual Representations and Targeted Code Repair
Mingjie Liu, Yun-Da Tsai, Wenfei Zhou, Haoxing Ren
PromSec: Prompt Optimization for Secure Generation of Functional Source Code with Large Language Models (LLMs)
Mahmoud Nazzal, Issa Khalil, Abdallah Khreishah, NhatHai Phan
ScriptSmith: A Unified LLM Framework for Enhancing IT Operations via Automated Bash Script Generation, Assessment, and Refinement
Oishik Chatterjee, Pooja Aggarwal, Suranjana Samanta, Ting Dai, Prateeti Mohapatra, Debanjana Kar, Ruchi Mahindru, Steve Barbieri, Eugen Postea, Brad Blancett, Arthur De Magalhaes
Tidal MerzA: Combining affective modelling and autonomous code generation through Reinforcement Learning
Elizabeth Wilson, György Fazekas, Geraint Wiggins
Demo: SGCode: A Flexible Prompt-Optimizing System for Secure Generation of Code
Khiem Ton, Nhi Nguyen, Mahmoud Nazzal, Abdallah Khreishah, Cristian Borcea, NhatHai Phan, Ruoming Jin, Issa Khalil, Yelong Shen
Reranking Laws for Language Generation: A Communication-Theoretic Perspective
António Farinhas, Haau-Sing Li, André F. T. Martins
Policy Filtration in RLHF to Fine-Tune LLM for Code Generation
Wei Shen, Chuheng Zhang