Code Clone

Code clone detection aims to identify duplicated or highly similar code segments within or across software projects, crucial for improving software quality, maintainability, and understanding code reuse patterns. Current research heavily focuses on leveraging large language models (LLMs) and graph-based methods, alongside contrastive learning and ensemble techniques, to improve the accuracy and efficiency of clone detection, particularly in cross-lingual scenarios. These advancements are significant for addressing challenges in software engineering, such as plagiarism detection, code comprehension, and facilitating more effective software maintenance practices.

Papers