Large Language Model
Large language models (LLMs) are sophisticated AI systems designed to process and generate human-like text, aiming to improve various natural language processing tasks. Current research focuses on enhancing LLM safety, efficiency (through techniques like quantization and optimized decoding), and fairness, as well as improving their ability to perform complex reasoning and handle diverse instructions. These advancements are significant because they address critical limitations in current LLMs and pave the way for broader applications across diverse fields, including healthcare, legal tech, and autonomous systems.
Papers
Improved Large Language Model Jailbreak Detection via Pretrained Embeddings
Erick Galinkin, Martin Sablotny
Scaling Law for Language Models Training Considering Batch Size
Xian Shuai, Yiding Wang, Yimeng Wu, Xin Jiang, Xiaozhe Ren
Early Exit Is a Natural Capability in Transformer-based Models: An Empirical Study on Early Exit without Joint Optimization
Weiqiao Shan, Long Meng, Tong Zheng, Yingfeng Luo, Bei Li, junxin Wang, Tong Xiao, Jingbo Zhu
Adapting Large Language Models to Log Analysis with Interpretable Domain Knowledge
Yuhe Ji, Yilun Liu, Feiyu Yao, Minggui He, Shimin Tao, Xiaofeng Zhao, Su Chang, Xinhua Yang, Weibin Meng, Yuming Xie, Boxing Chen, Hao Yang
Understanding the World's Museums through Vision-Language Reasoning
Ada-Astrid Balauca, Sanjana Garai, Stefan Balauca, Rasesh Udayakumar Shetty, Naitik Agrawal, Dhwanil Subhashbhai Shah, Yuqian Fu, Xi Wang, Kristina Toutanova, Danda Pani Paudel, Luc Van Gool
The "LLM World of Words" English free association norms generated by large language models
Katherine Abramski, Riccardo Improta, Giulio Rossetti, Massimo Stella
RL2: Reinforce Large Language Model to Assist Safe Reinforcement Learning for Energy Management of Active Distribution Networks
Xu Yang, Chenhui Lin, Haotian Liu, Wenchuan Wu
CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
Kaixin Wu, Yixin Ji, Zeyuan Chen, Qiang Wang, Cunxiang Wang, Hong Liu, Baijun Ji, Jia Xu, Zhongyi Liu, Jinjie Gu, Yuan Zhou, Linjian Mo
Do Large Language Models with Reasoning and Acting Meet the Needs of Task-Oriented Dialogue?
Michelle Elizabeth, Morgan Veyret, Miguel Couceiro, Ondrej Dusek, Lina M. Rojas-Barahona
Best Practices for Large Language Models in Radiology
Christian Bluethgen, Dave Van Veen, Cyril Zakka, Katherine Link, Aaron Fanous, Roxana Daneshjou, Thomas Frauenfelder, Curtis Langlotz, Sergios Gatidis, Akshay Chaudhari
FD-LLM: Large Language Model for Fault Diagnosis of Machines
Hamzah A.A.M. Qaid, Bo Zhang, Dan Li, See-Kiong Ng, Wei Li
MiningGPT -- A Domain-Specific Large Language Model for the Mining Industry
Kurukulasooriya Fernando ana Gianluca Demartini
Enhancing Function-Calling Capabilities in LLMs: Strategies for Prompt Formats, Data Integration, and Multilingual Translation
Yi-Chang Chen, Po-Chun Hsu, Chan-Jan Hsu, Da-shan Shiu
Advancing Speech Language Models by Scaling Supervised Fine-Tuning with Over 60,000 Hours of Synthetic Speech Dialogue Data
Shuaijiang Zhao, Tingwei Guo, Bajian Xiang, Tongtang Wan, Qiang Niu, Wei Zou, Xiangang Li
Detecting Memorization in Large Language Models
Eduardo Slonski
Competition Dynamics Shape Algorithmic Phases of In-Context Learning
Core Francisco Park, Ekdeep Singh Lubana, Itamar Pres, Hidenori Tanaka
Linear Probe Penalties Reduce LLM Sycophancy
Henry Papadatos, Rachel Freedman
LLMs as mirrors of societal moral standards: reflection of cultural divergence and agreement across ethical topics
Mijntje Meijer, Hadi Mohammadi, Ayoub Bagheri
Quantifying perturbation impacts for large language models
Paulius Rauba, Qiyao Wei, Mihaela van der Schaar
Large Language Models in Politics and Democracy: A Comprehensive Survey
Goshi Aoki