Z4
Balance

Z.AI: GLM 4.6V

by z-ai

GLM-4.6V is a cutting-edge large multimodal model engineered for exceptional visual understanding and advanced long-context reasoning. It excels at processing diverse inputs, including images, complex documents, and mixed media, making it ideal for intricate analytical tasks. This model boasts a substantial context window of 131K tokens and a max output of 4K tokens, enabling it to handle extensive information. It processes complex page layouts and charts directly as visual inputs and integrates native multimodal function calling, seamlessly connecting perception with downstream tool execution. Additionally, GLM-4.6V supports interleaved image-text generation and UI reconstruction workflows, such as screenshot-to-HTML synthesis and iterative visual editing. Pricing is set at $0.30 per 1M input tokens and $0.90 per 1M output tokens, accessible via the STARTER tier.

multimodalvisionlong contextfunction callinganalysis
70%Quality
131KContext Window
70%Speed
Category
Economy
API access
Unified context
RAG + Knowledge Base
24/7 Support
Try This ModelCompare models

Best For

Analysis
Document Processing
Visual Reasoning

🚀 Capabilities

Vision
Functions
Code
Streaming

Limitations

No image generation

Specifications

Providerz-ai
Context Window131,072 tokens
Max Output4,096 tokens
Minimum PlanBalance

Pricing

Input Price$0.3000 / 1M tokens
Output Price$0.9000 / 1M tokens

💡 With PRO subscription, cost is reduced by 20%

Ready to try Z.AI: GLM 4.6V?

Get 1,000 tokens free on signup

Start for free