Paper ID: 2408.11857

Hermes 3 Technical Report

Ryan Teknium, Jeffrey Quesnelle, Chen Guang

Instruct (or "chat") tuned models have become the primary way in which most people interact with large language models. As opposed to "base" or "foundation" models, instruct-tuned models are optimized to respond to imperative statements. We present Hermes 3, a neutrally-aligned generalist instruct and tool use model with strong reasoning and creative abilities. Its largest version, Hermes 3 405B, achieves state of the art performance among open weight models on several public benchmarks.

Submitted: Aug 15, 2024

Topics

Large Language Model
Technical Report
Generalist Agent
Open Weight Model
Instruct Tuned Model

Links

arXiv PDF