Large Language Model
Large language models (LLMs) are sophisticated AI systems designed to process and generate human-like text, aiming to improve various natural language processing tasks. Current research focuses on enhancing LLM safety, efficiency (through techniques like quantization and optimized decoding), and fairness, as well as improving their ability to perform complex reasoning and handle diverse instructions. These advancements are significant because they address critical limitations in current LLMs and pave the way for broader applications across diverse fields, including healthcare, legal tech, and autonomous systems.
Papers
ModelGrow: Continual Text-to-Video Pre-training with Model Expansion and Language Understanding Enhancement
Zhefan Rao, Liya Ji, Yazhou Xing, Runtao Liu, Zhaoyang Liu, Jiaxin Xie, Ziqiao Peng, Yingqing He, Qifeng Chen
DCIS: Efficient Length Extrapolation of LLMs via Divide-and-Conquer Scaling Factor Search
Lei Yang, Shaoyang Xu, Deyi Xiong
Optimizing Large Language Models with an Enhanced LoRA Fine-Tuning Algorithm for Efficiency and Robustness in NLP Tasks
Jiacheng Hu, Xiaoxuan Liao, Jia Gao, Zhen Qi, Hongye Zheng, Chihang Wang
Speech Recognition With LLMs Adapted to Disordered Speech Using Reinforcement Learning
Chirag Nagpal, Subhashini Venugopalan, Jimmy Tobin, Marilyn Ladewig, Katherine Heller, Katrin Tomanek
Agents on the Bench: Large Language Model Based Multi Agent Framework for Trustworthy Digital Justice
Cong Jiang, Xiaolei Yang
AgreeMate: Teaching LLMs to Haggle
Ainesh Chatterjee, Samuel Miller, Nithin Parepally
A Paragraph is All It Takes: Rich Robot Behaviors from Interacting, Trusted LLMs
OpenMind, Shaohong Zhong, Adam Zhou, Boyuan Chen, Homin Luo, Jan Liphardt
Token-Budget-Aware LLM Reasoning
Tingxu Han, Chunrong Fang, Shiyu Zhao, Shiqing Ma, Zhenyu Chen, Zhenting Wang
Large Language Model guided Deep Reinforcement Learning for Decision Making in Autonomous Driving
Hao Pang, Zhenpo Wang, Guoqiang Li
A Statistical Framework for Ranking LLM-Based Chatbots
Siavash Ameli, Siyuan Zhuang, Ion Stoica, Michael W. Mahoney
ERPA: Efficient RPA Model Integrating OCR and LLMs for Intelligent Document Processing
Osama Abdellaif, Abdelrahman Nader, Ali Hamdi
Quo Vadis, Anomaly Detection? LLMs and VLMs in the Spotlight
Xi Ding, Lei Wang
Annotating References to Mythological Entities in French Literature
Thierry Poibeau (Lattice)
Expand VSR Benchmark for VLLM to Expertize in Spatial Rules
Peijin Xie, Lin Sun, Bingquan Liu, Dexin Wang, Xiangzheng Zhang, Chengjie Sun, Jiajia Zhang
GeneSUM: Large Language Model-based Gene Summary Extraction
Zhijian Chen, Chuan Hu, Min Wu, Qingqing Long, Xuezhi Wang, Yuanchun Zhou, Meng Xiao
LSAQ: Layer-Specific Adaptive Quantization for Large Language Model Deployment
Binrui Zeng, Bin Ji, Xiaodong Liu, Jie Yu, Shasha Li, Jun Ma, Xiaopeng Li, Shangwen Wang, Xinran Hong
AutoDroid-V2: Boosting SLM-based GUI Agents via Code Generation
Hao Wen, Shizuo Tian, Borislav Pavlov, Wenjie Du, Yixuan Li, Ge Chang, Shanhui Zhao, Jiacheng Liu, Yunxin Liu, Ya-Qin Zhang, Yuanchun Li
SlimGPT: Layer-wise Structured Pruning for Large Language Models
Gui Ling, Ziyang Wang, Yuliang Yan, Qingwen Liu
Real-world Deployment and Evaluation of PErioperative AI CHatbot (PEACH) -- a Large Language Model Chatbot for Perioperative Medicine
Yu He Ke, Liyuan Jin, Kabilan Elangovan, Bryan Wen Xi Ong, Chin Yang Oh, Jacqueline Sim, Kenny Wei-Tsen Loh, Chai Rick Soh, Jonathan Ming Hua Cheng, Aaron Kwang Yang Lee, Daniel Shu Wei Ting, Nan Liu, Hairil Rizal Abdullah
Molly: Making Large Language Model Agents Solve Python Problem More Logically
Rui Xiao, Jiong Wang, Lu Han, Na Zong, Han Wu