Semantic Code
Semantic code search aims to retrieve code snippets matching natural language descriptions, bridging the semantic gap between programming languages and human language. Current research focuses on improving search accuracy using techniques like Retrieval Augmented Generation (RAG) with large language models (LLMs), graph neural networks (GNNs) that leverage code structure (e.g., call graphs), and novel training methods incorporating both similar and dissimilar code examples to enhance model learning. These advancements significantly impact software development productivity by facilitating faster and more efficient code reuse and discovery, and are also relevant to tasks like malware analysis and automated machine learning.
Papers
August 5, 2024
June 17, 2024
June 2, 2024
September 1, 2023
May 5, 2023
May 21, 2022
January 27, 2022
January 25, 2022