Q3
Balance

Qwen: Qwen3 8B

by qwen

Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, engineered for both rigorous reasoning tasks and efficient dialogue. It features a unique ability to seamlessly switch between a "thinking" mode for math, coding, and logical inference, and a "non-thinking" mode for general conversation, making it highly adaptable. This model is fine-tuned for superior instruction-following, agent integration, and creative writing. It offers robust multilingual support across 100+ languages and dialects, natively handles a 32K token context window, and can extend up to 131K tokens with YaRN scaling. Capabilities include functions, code generation, and streaming. It's best for chat, code, and math tasks. Limitations include no image generation and no internet access. Pricing is $0.05/0.25 per 1M tokens (input/output), available for FREE access.

Text GenerationLanguage ModelMultilingualReasoning
65%Quality
32KContext Window
74%Speed
Category
Economy
API access
Unified context
RAG + Knowledge Base
24/7 Support
Try This ModelCompare models

Best For

Chat
Code Generation
Mathematical Reasoning

🚀 Capabilities

Structured output
JSON mode
Functions
Code Generation
Streaming

Limitations

No Image Generation
No Internet Access

Specifications

Providerqwen
Context Window32,000 tokens
Max Output8,192 tokens
Minimum PlanBalance

Pricing

Input Price$0.0500 / 1M tokens
Output Price$0.4000 / 1M tokens

💡 With PRO subscription, cost is reduced by 20%

Ready to try Qwen: Qwen3 8B?

Get 1,000 tokens free on signup

Start for free