Chinese Large Language Model
Chinese Large Language Models (LLMs) are rapidly evolving, aiming to match or surpass the capabilities of their English counterparts. Current research emphasizes rigorous evaluation across diverse benchmarks, focusing on areas like moral reasoning, financial domain expertise, and mitigating biases and hallucinations. These efforts are crucial for ensuring the responsible development and deployment of these powerful tools, with implications for various applications ranging from mental health support to professional fields like finance and medicine. The development of comprehensive evaluation platforms and datasets is a key focus, driving improvements in model performance and safety.
Papers
Evaluating the Generation Capabilities of Large Chinese Language Models
Hui Zeng, Jingyuan Xue, Meng Hao, Chen Sun, Bin Ning, Na Zhang
CLEVA: Chinese Language Models EVAluation Platform
Yanyang Li, Jianqiao Zhao, Duo Zheng, Zi-Yuan Hu, Zhi Chen, Xiaohui Su, Yongfeng Huang, Shijia Huang, Dahua Lin, Michael R. Lyu, Liwei Wang