Self Improvement
Self-improvement in artificial intelligence focuses on developing systems capable of autonomously enhancing their performance and capabilities without extensive human intervention. Current research emphasizes methods enabling large language models (LLMs) to refine their reasoning, problem-solving, and decision-making skills through iterative processes like reinforcement learning, self-supervised learning, and prompt engineering, often incorporating techniques such as Monte Carlo Tree Search and various self-critique mechanisms. This area is significant because it promises more efficient and adaptable AI systems, potentially leading to advancements in diverse fields such as automated diagnosis, scientific discovery, and robotic control.
Papers
Mind the Gap: Examining the Self-Improvement Capabilities of Large Language Models
Yuda Song, Hanlin Zhang, Carson Eisenach, Sham Kakade, Dean Foster, Udaya Ghai
VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning
Xueqing Wu, Yuheng Ding, Bingxuan Li, Pan Lu, Da Yin, Kai-Wei Chang, Nanyun Peng
Boundless Socratic Learning with Language Games
Tom Schaul
Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Zhiheng Xi, Dingwen Yang, Jixuan Huang, Jiafu Tang, Guanyu Li, Yiwen Ding, Wei He, Boyang Hong, Shihan Do, Wenyu Zhan, Xiao Wang, Rui Zheng, Tao Ji, Xiaowei Shi, Yitao Zhai, Rongxiang Weng, Jingang Wang, Xunliang Cai, Tao Gui, Zuxuan Wu, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Yu-Gang Jiang