Trinity-Large-Preview is a frontier-scale open-weight language model from Arcee, built as a 400B-parameter sparse Mixture-of-Experts with 13B active parameters per token using 4-of-256 expert routing. This model excels in creative writing, storytelling, role-play, chat scenarios, and real-time voice assistance, often outperforming average reasoning models. It also introduces advanced agentic performance, trained to navigate agent harnesses like OpenCode, Cline, and Kilo Code, and to handle complex toolchains and long, constraint-filled prompts. The architecture natively supports very long context windows up to 512k tokens, with the Preview API currently served at 128k context using 8-bit quantization for practical deployment. Trinity-Large-Preview reflects Arcee’s efficiency-first design philosophy, offering a production-oriented frontier model with open weights and permissive licensing suitable for real-world applications and experimentation. It features capabilities like JSON mode, streaming, functions, long context, and structured output. Access this powerful model for free on Multi AI.
✅ Best For
🚀 Capabilities
Specifications
| Provider | arcee-ai |
| Context Window | 131,000 tokens |
| Minimum Plan | Economy |
Pricing
| Input Price | Free / 1M tokens |
| Output Price | Free / 1M tokens |
💡 With PRO subscription, cost is reduced by 20%