Complete Recipe
"Complete Recipe" research focuses on developing efficient and effective methods for training and improving large language models (LLMs) and other machine learning models. Current research emphasizes techniques like programmatic data generation, in-context reinforcement learning, and innovative training strategies (e.g., continued pretraining, loss function modifications) to enhance model performance, particularly in handling long contexts and diverse data. These advancements are significant because they address the high computational cost and data limitations associated with training powerful models, leading to more accessible and efficient AI solutions across various applications.
Papers
August 8, 2023
July 28, 2023
July 14, 2023
June 29, 2023
June 15, 2023
May 30, 2023
May 9, 2023
April 24, 2023
April 10, 2023
March 17, 2023
March 10, 2023
March 3, 2023
February 3, 2023
December 31, 2022
December 9, 2022
October 25, 2022
October 20, 2022
September 26, 2022
September 15, 2022