DeepSeek Coder
DeepSeek Coder is a family of open-source large language models specifically designed for code generation and related tasks, aiming to provide a powerful and accessible alternative to closed-source models. Research focuses on improving model performance through techniques like Mixture-of-Experts architectures and repeated sampling during inference to boost problem-solving capabilities, even surpassing some closed-source competitors on certain benchmarks. The availability of these open-source models significantly advances code intelligence research and facilitates broader access to advanced code generation tools for both research and commercial applications.
Papers
November 15, 2024
November 11, 2024
October 21, 2024
July 31, 2024
June 17, 2024
May 30, 2024
May 27, 2024
January 25, 2024