Google: Gemma 3 4B introduces multimodality, allowing it to process both vision and language inputs to generate text outputs. This advanced model is designed for complex tasks, boasting a substantial context window of up to 128,000 tokens, enabling deep understanding and extended conversations. It supports over 140 languages, making it a versatile tool for global applications. Key enhancements include improved mathematical reasoning, logical problem-solving, and sophisticated chat capabilities, such as structured outputs and function calling. With a maximum output of 4,000 tokens and streaming support, Gemma 3 4B is efficient for real-time interactions. It's priced at $0.02 per 1M input tokens and $0.07 per 1M output tokens, and is available on a FREE access tier through Multi AI.
✅ Best For
🚀 Capabilities
❌ Limitations
Specifications
| Provider | |
| Context Window | 96,000 tokens |
| Max Output | 4,096 tokens |
| Minimum Plan | Economy |
Pricing
| Input Price | $0.0170 / 1M tokens |
| Output Price | $0.0682 / 1M tokens |
💡 With PRO subscription, cost is reduced by 20%