Browse Models

Xwin-LM /

Xwin LM 13B

Family

Llama 2

Type

Fine-Tuned Model

License

LLAMA-2 Community License Agreement

Released

2023-10-13

How To Use

Laboratory OS

Launch a dedicated cloud GPU server running Laboratory OS to download and run Xwin LM 13B using any compatible app or framework.

Direct Download

Must be used with a compatible app, notebook, or codebase. May run slowly, or not work at all, depending on local system resources, particularly GPU(s) and available VRAM.

Browse Compatible Apps

open-webui /

Open WebUI

Open WebUI is an open-source, self-hosted web interface with a polished, ChatGPT-like user experience for interacting with LLMs. Integrates seamlessly with local Ollama installation.

oobabooga /

Text Generation Web UI

The most full-featured web interface for experimenting with open source Large Language Models. Featuring a wide range of configurable settings, inference engines, and plugins.

Model Report

Xwin-LM / Xwin LM 13B

Xwin-LM 13B is a 13-billion parameter conversational language model built on the Llama 2 architecture that employs supervised fine-tuning, reward modeling, reject sampling, and reinforcement learning from human feedback for alignment. The model achieved a 93.22% win-rate against Text-Davinci-003 and 70.36% against GPT-4 on AlpacaEval benchmarks, demonstrating capabilities in multi-turn dialogue and natural language understanding tasks.

Explore the Future of AI

Your server, your data, under your control

Xwin-LM 13B is a large language model that integrates supervised fine-tuning, reward modeling, reject sampling, and reinforcement learning from human feedback as part of its alignment methodology for large language models (LLMs). The Xwin-LM series, including the 13B variant, is constructed atop the Llama 2 architecture, applied to conversational AI and open science in alignment technologies. Xwin-LM 13B has been evaluated on prominent benchmarks, with reported results against established models.

Model Development and Alignment Approach

The Xwin-LM project focuses on developing and open-sourcing alignment methods for LLMs. The Xwin-LM 13B model employs techniques such as supervised fine-tuning (SFT), reward modeling (RM), reject sampling, and reinforcement learning from human feedback (RLHF) to configure model behavior in dialogue contexts. Notably, the v0.2 release incorporates Proximal Policy Optimization (PPO) as part of its reinforcement learning strategy, which impacts conversational engagement and alignment with human preferences.

The model architecture is based on the Llama 2 foundation, which is a widely used transformer-based model family. The Xwin-LM 13B variant, referring to its roughly 13 billion parameters, is part of a wider family that also encompasses Xwin-LM 7B and Xwin-LM 70B models. This design enables scalability and comparison across model sizes within a unified alignment framework.

Performance and Benchmarking

Xwin-LM 13B has undergone comprehensive evaluation using prominent benchmarks for dialogue and foundational natural language understanding tasks. The AlpacaEval benchmark gauges model win-rates versus established baselines such as Text-Davinci-003, ChatGPT, and GPT-4 across a suite of user questions. On this metric, Xwin-LM-13B-v0.2 attained a 93.22% win-rate against Text-Davinci-003, 87.14% against ChatGPT, and 70.36% against GPT-4.

On general natural language processing tasks, Xwin-LM-13B exhibited consistent performance across multiple test suites. According to the Open LLM Leaderboard, the 13B variant records results such as 56.6 on MMLU (5-shot), 61.5 on ARC (25-shot), 43.8 on TruthfulQA (0-shot), and 82.9 on HellaSwag (10-shot), for an aggregate average near 61.2. These results are observed among comparable models, reflecting the impact of the alignment training regime.

Training Methodology and Data

Xwin-LM-13B is trained with a multi-stage pipeline. Initially, the model is finetuned on curated conversations to guide baseline behavior via SFT. Subsequently, a reward model is constructed to quantitatively assess model outputs. The introduction of reject sampling and RLHF — particularly the use of PPO in v0.2 — impacts alignment, with reported win-rate changes over previous versions and baseline competitors. The training strategy is devised to support complex, multi-turn conversations and emulate context-sensitive assistant behaviors.

While the specific datasets used for alignment and conversation training are not fully disclosed, the methodology parallels established practices that combine open-source instruction datasets and proprietary evaluation protocols. The Vicuna conversation template is adopted for prompt formatting, structuring dialogue between a user and the assistant.

Model Releases and Timeline

The developmental timeline for Xwin-LM 13B features distinct version releases that reflect updates in its alignment and training processes. The initial release, v0.1, debuted in September 2023, with a reported 91.76% win-rate on AlpacaEval under open-access evaluation at the time. The subsequent v0.2 release in October 2023 incorporated updated reward modeling and PPO-based RLHF, with reported performance changes, particularly in direct evaluation against GPT-4.

Xwin-LM 13B forms part of a broader suite of models, including Xwin-LM 7B and Xwin-LM 70B parameter versions. The Xwin-LM 70B variant's reported results on AlpacaEval indicate a 60.61% win-rate versus GPT-4. The Xwin-LM 7B models are alternatives with similar alignment strategies.

Applications, Limitations, and Licensing

Xwin-LM 13B is designed for general conversational AI and natural language assistance, demonstrating the capacity to respond to diverse questions. Its multi-turn conversational abilities are enabled by the Vicuna-based prompt format, which structures dialogue between a user and the assistant.

Despite its reported capabilities, certain limitations remain. The source code for Xwin-LM has not been publicly released at the time of writing, although future releases are planned to address this. Additionally, the developers indicate ongoing efforts to enhance capabilities in mathematical reasoning and related domains.

Licensing for Xwin-LM models adheres to the Llama 2 License, consistent with the conditions governing the use and distribution of the Llama 2 foundation model.

Xwin LM 13B

Laboratory OS

Direct Download

Open WebUI

Text Generation Web UI

Explore the Future of AI

Your server, your data, under your control

Xwin LM 13B

Laboratory OS

Direct Download

Open WebUI

Text Generation Web UI

Explore the Future of AI

Your server, your data, under your control

Model Development and Alignment Approach

Performance and Benchmarking

Training Methodology and Data

Model Releases and Timeline

Applications, Limitations, and Licensing

Helpful External Resources