Premium

Qwen: Qwen3 VL 30B A3B Thinking

Name: Qwen: Qwen3 VL 30B A3B Thinking
Brand: qwen
Price: 130 USD
Rating: 3.8 (1 reviews)

Qwen3-VL-30B-A3B-Thinking is a cutting-edge multimodal AI model designed to seamlessly integrate robust text generation with sophisticated visual understanding across images and videos. The 'Thinking' variant specifically boosts its reasoning capabilities in demanding fields like STEM, mathematics, and other complex problem-solving scenarios. It demonstrates exceptional performance in perceiving real-world and synthetic categories, precise 2D/3D spatial grounding, and comprehensive long-form visual comprehension, consistently achieving competitive results on multimodal benchmarks. This model is particularly well-suited for agentic applications, capably handling multi-image, multi-turn instructions, video timeline alignments, GUI automation, and even visual coding from initial sketches to debugged user interfaces. Its text performance mirrors that of flagship Qwen3 models, making it highly effective for Document AI, OCR, UI assistance, spatial tasks, and advanced agent research. With a context window of 131K tokens and a max output of 4K tokens, it offers extensive processing power. Pricing is competitive at $0.20 per 1M input tokens and $1.00 per 1M output tokens, accessible via the STARTER tier on Multi AI.

MultimodalVisionReasoningAgentic AISTEM

75%Quality

131KContext Window

70%Speed