Browse Models
The simplest way to self-host Pygmalion 2 13B. Launch a dedicated cloud GPU server running Lab Station OS to download and serve the model using any compatible app or framework.
Download model weights for local inference. Must be used with a compatible app, notebook, or codebase. May run slowly, or not work at all, depending on your system resources, particularly GPU(s) and available VRAM.
Pygmalion-2 13B is a conversational AI model built on Llama-2 13B, designed for creative writing and role-playing. It uses a three-role token structure (system/user/model) for improved context retention. Trained using the Axolotl framework on the PIPPA dataset, it excels at maintaining character consistency in extended dialogues.
Pygmalion-2 13B represents a significant evolution in conversational AI models, built upon Meta AI's Llama-2 13B architecture. This instruction-tuned language model specializes in conversation, role-playing, and creative writing applications, marking a substantial advancement over its predecessors in the Pygmalion family.
The model's training methodology combines traditional instruction tuning with specialized data sources, including role-playing content, fictional narratives, and conversational exchanges. A key component of the training data is the PIPPA dataset, alongside carefully curated instruction datasets from role-playing forums. The training process, facilitated by the Axolotl framework, employed synthetic instruction generation to enhance the model's capabilities.
A distinctive feature of Pygmalion-2 13B is its sophisticated prompting format, which represents an improvement over the classic Pygmalion model. The system utilizes three specific tokens for different roles in interactions:
<|system|>
: Used for providing background information and context<|user|>
: Designates user input and queries<|model|>
: Marks the model's generated responsesThis structured approach enables complex interactions and maintains coherent conversation history, making it particularly effective for creative and interactive applications.
The model operates under the Llama-2 license, which permits both commercial and non-commercial applications. However, potential users should note several important considerations:
The development of Pygmalion-2 13B was supported by SpicyChat, demonstrating the growing interest in specialized conversational AI models for creative applications.