Qwen-Max, built upon the advanced Qwen2.5 architecture, delivers exceptional inference performance, particularly excelling in complex multi-step tasks. This large-scale Mixture-of-Experts (MoE) model has undergone extensive pretraining on over 20 trillion tokens, followed by sophisticated post-training using curated Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF) methodologies. While its exact parameter count remains undisclosed, its performance speaks volumes. Designed for versatility, Qwen-Max supports both function calling and streaming capabilities, making it suitable for dynamic and interactive AI applications. It boasts a substantial context window of 32K tokens and can generate outputs up to 4K tokens. Pricing is set at $1.60 per 1M input tokens and $6.40 per 1M output tokens, accessible via our PRO tier. Explore Qwen-Max for your most demanding AI projects today.
✅ Best For
🚀 Capabilities
❌ Limitations
Specifications
| Provider | qwen |
| Context Window | 32,768 tokens |
| Max Output | 4,096 tokens |
| Minimum Plan | Premium |
Pricing
| Input Price | $1.6000 / 1M tokens |
| Output Price | $6.4000 / 1M tokens |
💡 With PRO subscription, cost is reduced by 20%