Gemini 3 Flash Preview is Google's advanced, high-speed AI model optimized for efficiency and performance. It excels in agentic workflows, multi-turn conversations, and coding assistance, delivering reasoning capabilities comparable to larger Gemini variants but with significantly reduced latency. This makes it ideal for interactive development, long-running agent loops, and collaborative coding tasks, offering broad quality improvements over Gemini 2.5 Flash in reasoning, multimodal understanding, and reliability. This powerful model supports a massive 1M token context window and accepts diverse multimodal inputs including text, images, audio, video, and PDFs, outputting text. Key features include configurable reasoning via thinking levels (minimal, low, medium, high), structured output, robust tool use, and automatic context caching. Priced at $0.50/3.00 per 1M tokens (input/output), Gemini 3 Flash Preview is perfect for users needing strong reasoning and agentic behavior without the cost or latency of full-scale frontier models. It supports streaming, vision, functions, and long context capabilities.
✅ Best For
🚀 Capabilities
Specifications
| Provider | |
| Context Window | 1,048,576 tokens |
| Max Output | 65,536 tokens |
| Minimum Plan | Premium |
Pricing
| Input Price | $0.5000 / 1M tokens |
| Output Price | $3.0000 / 1M tokens |
💡 With PRO subscription, cost is reduced by 20%