Z4
Balance

Z.AI: GLM 4.5 Air

by z-ai

GLM-4.5-Air is the lightweight variant of Z.AI's latest flagship model family, purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter size, making it efficient for various tasks. This model excels in scenarios requiring quick, responsive AI. It supports hybrid inference modes, offering a "thinking mode" for advanced reasoning and tool use, and a "non-thinking mode" for real-time interaction. Users can control the reasoning behaviour with the `reasoning` `enabled` boolean. With a context window of 131K tokens and a max output of 4K tokens, GLM-4.5-Air is priced at $0.05/0.22 per 1M tokens (input/output) and is available on the STARTER access tier. It supports functions, code, and streaming capabilities.

Text AIAgent-CentricLightweightMoE
67%Quality
131KContext Window
70%Speed
Category
Economy
API access
Unified context
RAG + Knowledge Base
24/7 Support
Try This ModelCompare models

Best For

Chat
Real-time Interaction
Reasoning

🚀 Capabilities

Long context
Structured output
JSON mode
Functions
Code
Streaming

Limitations

No image generation

Specifications

Providerz-ai
Context Window131,072 tokens
Max Output98,304 tokens
Minimum PlanBalance

Pricing

Input Price$0.1300 / 1M tokens
Output Price$0.8500 / 1M tokens

💡 With PRO subscription, cost is reduced by 20%

Ready to try Z.AI: GLM 4.5 Air?

Get 1,000 tokens free on signup

Start for free