Code Hallucination

Code hallucination refers to the generation of plausible-sounding but factually incorrect or incomplete code by large language models (LLMs), posing a significant risk in software development. Current research focuses on classifying hallucination types (e.g., involving API calls, package names, or logical errors), developing benchmarks for evaluating LLMs' susceptibility, and exploring mitigation strategies such as leveraging documentation or improving model training. Understanding and addressing code hallucinations is crucial for ensuring the reliability and security of software increasingly reliant on AI-assisted code generation.

Papers

October 13, 2024

Collu-Bench: A Benchmark for Predicting Language Model Hallucinations in Code
Nan Jiang, Qi Li, Lin Tan, Tianyi Zhang
Large Language Model New Benchmark Real World Code MT Bench Language Model Hallucination Code Hallucination

August 14, 2024

CodeMirage: Hallucinations in Code Generated by Large Language Models
Vibhor Agarwal, Yulong Pei, Salwa Alamir, Xiaomo Liu
Code Generation Real World Code GIT Net Code Hallucination

July 13, 2024

On Mitigating Code LLM Hallucinations with API Documentation
Nihal Jain, Robert Kwiatkowski, Baishakhi Ray, Murali Krishna Ramanathan, Varun Kumar
API Usage LLM Era Code Hallucination

July 5, 2024

Code Hallucination
Mirza Masfiqur Rahman, Ashish Kundu
LLM Hallucination Program Generation Code Hallucination

June 12, 2024

We Have a Package for You! A Comprehensive Analysis of Package Hallucinations by Code Generating LLMs
Joseph Spracklen, Raveen Wijewickrama, A H M Nazmus Sakib, Anindya Maiti, Bimal Viswanath, Murtuza Jadliwala
Large Language Model Comprehensive Analysis Code Generation Model SPR Package Code Hallucination

May 19, 2024

Cyber Risks of Machine Translation Critical Errors : Arabic Mental Health Tweets as a Case Study
Hadeel Saadany, Ashraf Tantawy, Constantin Orasan
Case Study Machine Translation Neural Machine Translation Cyber Threat Machine Translation Output Arabic Tweet Code Hallucination

April 30, 2024

CodeHalu: Investigating Code Hallucinations in LLMs via Execution-based Verification
Yuchen Tian, Weixiang Yan, Qian Yang, Xuandong Zhao, Qian Chen, Wen Wang, Ziyang Luo, Lei Ma, Dawn Song
Large Language Model Code Generation Code Benchmark Code Hallucination

March 3, 2024

Quantity Matters: Towards Assessing and Mitigating Number Hallucination in Large Vision-Language Models
Huixuan Zhang, Junzhe Zhang, Xiaojun Wan
Large Vision Language Model Quantity Aware Code Hallucination

February 17, 2023

Algorithmic Hallucinations of Near-Surface Winds: Statistical Downscaling with Generative Adversarial Networks to Convection-Permitting Scales
Nicolaas J. Annau, Alex J. Cannon, Adam H. Monahan
Generative Adversarial Network Image Super Resolution Statistical Downscaling Ghanaian Musical Scale Code Hallucination

Code Hallucination

Papers

Collu-Bench: A Benchmark for Predicting Language Model Hallucinations in Code

CodeMirage: Hallucinations in Code Generated by Large Language Models

On Mitigating Code LLM Hallucinations with API Documentation

Code Hallucination

We Have a Package for You! A Comprehensive Analysis of Package Hallucinations by Code Generating LLMs

Cyber Risks of Machine Translation Critical Errors : Arabic Mental Health Tweets as a Case Study

CodeHalu: Investigating Code Hallucinations in LLMs via Execution-based Verification

Quantity Matters: Towards Assessing and Mitigating Number Hallucination in Large Vision-Language Models

Algorithmic Hallucinations of Near-Surface Winds: Statistical Downscaling with Generative Adversarial Networks to Convection-Permitting Scales