Open Source Model

Open-source large language models (LLMs) aim to democratize access to powerful AI by providing freely available model weights, code, and sometimes even training data. Current research focuses on improving the performance and safety of these models, including developing novel training techniques, exploring efficient model compression methods like pruning and merging, and establishing robust benchmarks for evaluating trustworthiness, bias, and safety. This open approach fosters collaboration, accelerates innovation, and addresses concerns about proprietary model limitations, particularly regarding data privacy and accessibility for researchers and developers in various fields.

Papers

August 21, 2024

Clinical Insights: A Comprehensive Review of Language Models in Medicine
Nikita Neveditsin, Pawan Lingras, Vijay Mago
Large Language Model Language Model Artificial Intelligence Comprehensive Review Open Source Model Domain Specific Model Artificial Intelligence Solution

August 20, 2024

Perception-guided Jailbreak against Text-to-Image Models
Yihao Huang, Le Liang, Tianlin Li, Xiaojun Jia, Run Wang, Weikai Miao, Geguang Pu, Yang Liu
Text to Image Model Text to Image Jailbreak Attack Open Source Model Text Semantics

August 5, 2024

Evaluating the Performance of Large Language Models for SDG Mapping (Technical Report)
Hui Yin, Amir Aryani, Nakul Nambiar
Large Language Model Language Model System Performance Multi Label Technical Report Open Source Model Sustainable Development Goal

July 22, 2024

AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?
Ori Yoran, Samuel Joseph Amouyal, Chaitanya Malaviya, Ben Bogin, Ofir Press, Jonathan Berant
Language Model Real World Language Agent Retrieval Augmented Language Model Open Source Model Web Agent Efficient Task

July 20, 2024

Falcon2-11B Technical Report
Quentin Malartic, Nilabhra Roy Chowdhury, Ruxandra Cojocaru, Mugariya Farooq, Giulia Campesan, Yasser Abdelaziz Dahou Djilali, Sanath Narayan, Ankit Singh, Maksim Velikanov, Basma El Amel Boussaha, Mohammed Al-Yafeai, Hamza Alobeidli, Leen Al Qadi, Mohamed El Amine Seddik, Kirill Fedyanin, Reda Alami, Hakim Hacid
Vision Language Model Multi Modal Technical Report High Quality Open Source Model Image to Text

July 18, 2024

Can Open-Source LLMs Compete with Commercial Models? Exploring the Few-Shot Performance of Current GPT Models in Biomedical Tasks
Samy Ateia, Udo Kruschwitz
Large Language Model Natural Language Processing Fine Tuning Full Model System Performance Open Source Model GPT Model Commercial Large Language Model Biomedical Task

July 15, 2024

AstroMLab 1: Who Wins Astronomy Jeopardy!?
Yuan-Sen Ting, Tuan Dung Nguyen, Tirthankar Ghosal, Rui Pan, Hardik Arora, Zechang Sun, Tijmen de Haan, Nesar Ramachandra, Azton Wells, Sandeep Madireddy, Alberto Accomazzi
Large Language Model Open Source Model Open Weight Model

July 3, 2024

Large Language Models as Evaluators for Scientific Synthesis
Julia Evans, Jennifer D'Souza, Sören Auer
Large Language Model Human Evaluation Open Source Model Scientific Synthesis Evidence Synthesis

July 2, 2024

Automated Text Scoring in the Age of Generative AI for the GPU-poor
Christopher Michael Ormerod, Alexander Kwako
Generative AI Generative Language Model Single GPU Speech Based Age Open Source Model Text Evaluation Explicit Feedback Based Recommender System

June 19, 2024

BeHonest: Benchmarking Honesty in Large Language Models
Steffi Chern, Zhulin Hu, Yuqing Yang, Ethan Chern, Yuan Guo, Jiahe Jin, Binjie Wang, Pengfei Liu
Open Source Model AI Community Truthful Space Damo NLP

May 9, 2024

OpenBA-V2: Reaching 77.3% High Compression Ratio with Fast Multi-Stage Pruning
Dan Qiao, Yi Su, Pinzheng Wang, Jing Ye, Wenjing Xie, Yuechi Zhou, Yuyang Ding, Zecheng Tang, Jikai Wang, Yixin Ji, Yue Wang, Pei Guo, Zechen Sun, Zikang Zhang, Juntao Li, Pingfu Chao, Wenliang Chen, Guohong Fu, Guodong Zhou, Qiaoming Zhu, Min Zhang
Entity Recognition Open Source Model High Compression

May 3, 2024

April 30, 2024

Octopus v4: Graph of language models
Wei Chen, Zhiyuan Li
Language Model Graph Drawing Open Source Model Smaller Language Model Compound Token Octopus V2

April 15, 2024

Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations
David Nadeau, Mike Kroutikov, Karen McNeil, Simon Baribeau
Absolute Stance Bias OpenAI Codex Factual Claim Open Source Model Concept Activation Vector Tuned Llama Model OpenAI GPT Mistral 7B

April 11, 2024

rollama: An R package for using generative large language models through Ollama
Johannes B. Gruber, Maximilian Weber
Generative Large Language Model Open Source Model R Package

April 8, 2024

Xiwu: A Basis Flexible and Learnable LLM for High Energy Physics
Zhengde Zhang, Yiyu Zhang, Haodong Yao, Jianwen Luo, Rui Zhao, Bo Huang, Jiameng Zhao, Yipu Liao, Ke Li, Lina Zhao, Jun Cao, Fazhi Qi, Changzheng Yuan
LLM Based High Energy Physic Open Source Model Advanced Large Language Model

March 19, 2024

Evolutionary Optimization of Model Merging Recipes
Takuya Akiba, Makoto Shing, Yujin Tang, Qi Sun, David Ha
Foundation Model Model Merging Open Source Model Evolutionary Optimization Model Generation

March 11, 2024

On the Consideration of AI Openness: Can Good Intent Be Abused?
Yeeun Kim, Eunkyung Choi, Hyunjun Kim, Hongseok Oh, Hyunseo Shin, Wonseok Hwang
Artificial Intelligence Open Source Open Source LLM Open Source Model Crucial Consideration

February 20, 2024

A Survey on Knowledge Distillation of Large Language Models
Xiaohan Xu, Ming Li, Chongyang Tao, Tao Shen, Reynold Cheng, Jinyang Li, Can Xu, Dacheng Tao, Tianyi Zhou
Timely Survey Knowledge Distillation Open Source Large Language Model Open Source Model