logo
0
Table of Contents

Gemini 3 Flash: Google's Fast New AI Model is Here

Gemini 3 Flash: Google's Fast New AI Model is Here

Google just released Gemini 3 Flash, a faster and cheaper version of Gemini 3 Pro. It's now the default model in the Gemini app worldwide.

Google just released Gemini 3 Flash, a faster and cheaper version of Gemini 3 Pro. It's now the default model in the Gemini app worldwide.

What is Gemini 3 Flash?

Gemini 3 Flash delivers "frontier intelligence built for speed at a fraction of the cost." It matches Gemini 3 Pro performance in many benchmarks while being 3x faster and significantly cheaper.

Known Codenames

Before release, Google tested Gemini 3 Flash on LM Arena under these codenames:

  • Lithiumflow — efficiency-optimized variant
  • Ghost Falcon — final testing version
  • Oceanstone — earlier test version
  • Fierce Falcon — Pro GA variant

Key Specs

FeatureGemini 3 Flash
Release DateDecember 17, 2025
Speed3x faster than 2.5 Pro
Default ModelYes (Gemini App globally)
Best ForVideo analysis, data extraction, visual Q&A

Pricing

TierPrice
Input$0.50 per million tokens
Output$3.00 per million tokens

Slightly higher than Gemini 2.5 Flash ($0.30/$2.50) but with significantly better performance.

Benchmarks

BenchmarkGemini 3 FlashGemini 3 Pro
MMMU-Pro81.2%81%
Humanity's Last Exam33.7%37.5%
GPQA Diamond90.4%91.9%

Gemini 3 Flash even beats Pro on MMMU-Pro, Toolathlon, and MPC Atlas benchmarks.

Who's Using It

Major companies already on board:

  • JetBrains
  • Figma
  • Cursor
  • Harvey
  • Latitude

How to Access

Consumers:

  • Gemini App (default model)
  • AI Mode in Google Search

Developers:

  • Google AI Studio (preview)
  • Vertex AI
  • Gemini CLI
  • Google Antigravity
  • Android Studio

Model Picker Options

In Gemini App, you'll see:

  • "Fast" — Gemini 3 Flash for quick answers
  • "Thinking" — Gemini 3 Flash for complex problems
  • "Pro" — Gemini 3 Pro for advanced math/code

Gemini 3 Flash is Google's new workhorse model — fast, affordable, and powerful enough for most tasks. Perfect for bulk processing, real-time apps, and cost-conscious developers.