Browse Models
The simplest way to self-host Nous Hermes 13b. Launch a dedicated cloud GPU server running Lab Station OS to download and serve the model using any compatible app or framework.
Download model weights for local inference. Must be used with a compatible app, notebook, or codebase. May run slowly, or not work at all, depending on your system resources, particularly GPU(s) and available VRAM.
Nous Hermes 13b is a fine-tuned version of Llama 13b trained on 300,000 instructions, including GPT-4 synthetic data and domain-specific scientific content. It excels in reasoning tasks and factual accuracy, ranking highly in benchmarks like ARC and Hellaswag. Supports 2000-token context length using Alpaca prompts.
Nous Hermes 13b represents a significant advancement in open-source language models, built upon the foundation of Llama 13b through extensive fine-tuning. The model emerged from a collaborative effort between Nous Research, Teknium, Karan4D, Redmond AI, and other contributors, showcasing the power of community-driven AI development.
The model's architecture leverages the base Llama 13b structure, enhanced through fine-tuning on over 300,000 instructions. Training was conducted on high-performance hardware, specifically an 8x A100 80GB DGX machine, with a sequence length of 2000 over a period exceeding 50 hours.
The training dataset for Nous Hermes 13b was carefully curated from multiple high-quality sources. The primary component consisted of synthetic GPT-4 outputs, including:
Additional specialized training data incorporated Camel-AI's Biology, Physics, Chemistry, and Math Datasets, along with Airoboros' GPT-4 Dataset. This diverse training corpus contributes to the model's broad capabilities across different domains.
Nous Hermes 13b demonstrates exceptional performance across various benchmarks, achieving results comparable to GPT-3.5-turbo. The model particularly excels in several key areas, ranking first in multiple benchmark categories:
It also achieved a notable second-place ranking in Winogrande, according to GPT4all's benchmark list.
Key features of the model include:
The model implements the Alpaca prompt format, supporting two primary structures:
### Instruction:
### Response:
or
### Instruction:
### Input:
### Response:
Practical applications of Nous Hermes 13b have been demonstrated through various implementations, including chatbots and roleplaying systems. The model's capabilities can be explored through the Alpaca Discord Bot and Alpaca Roleplay Discord Bot, which showcase its versatility in real-world applications.
Nous Hermes 13b is released under the GPL license, making it accessible for research and development purposes. The model is available for download through Hugging Face, with ongoing development plans including GGML and GPTQ 4bit quantizations, as well as comprehensive benchmark testing.