OpenAI Codex
OpenAI Codex, and its successor models like o1, represent a significant advancement in large language models (LLMs) focused on enhancing reasoning capabilities beyond traditional next-word prediction. Current research emphasizes evaluating these models' performance across diverse complex tasks, including planning, code generation, and medical diagnosis, often comparing them to other LLMs and exploring limitations such as probability sensitivity and hallucination. This research is crucial for understanding the strengths and weaknesses of advanced LLMs, informing their responsible development and deployment in various applications ranging from software engineering to healthcare.
Papers
November 9, 2024
October 29, 2024
October 23, 2024
October 17, 2024
October 12, 2024
October 11, 2024
October 8, 2024
October 2, 2024
September 30, 2024
September 27, 2024
September 24, 2024
September 23, 2024
September 20, 2024
September 19, 2024
September 18, 2024
September 17, 2024
September 15, 2024