Official Google Cloud Vertex AI documentation page. Attribution: Google Cloud

4/8/20264 min read

Google Vertex AI & Gemini 3.1: Powerful Models, Fragmented Rollout, Real Trade-Offs

#AI #Google #Gemini #Vertex AI #Development

The current Google picture is more complicated than I wrote before

You were right to call this out. Google's current Vertex AI docs do explicitly list Gemini 3.1 Pro and Gemini 3.1 Flash-Lite, and the featured Gemini section currently highlights 3.1 Pro, 3 Flash, and 3.1 Flash-Lite. So any version of this article that frames Google's lineup as if it stopped at 2.5 is incomplete.

At the same time, Google's own documentation still makes the model story feel split across two layers. The All Google models page highlights the newer 3.x family, while the same page's Generally available Gemini models section still prominently lists Gemini 2.5 Pro and Gemini 2.5 Flash-Lite. The pricing page also shows both worlds at once: Gemini 3.1 Pro Preview and Gemini 3.1 Flash-Lite Preview appear there alongside the more established 2.5 pricing rows.

So what is actually current?

The careful answer is this: Gemini 3.x and 3.1 are current in the live Vertex AI documentation and model catalog. They are not imaginary, outdated, or hidden. But 2.5 is still part of the practical production conversation, because Google's generally available model section and pricing tables still give it a very visible role.

That means developers are dealing with a mixed reality:

Newest featured path: Gemini 3.1 Pro, Gemini 3 Flash, Gemini 3.1 Flash-Lite
Still-prominent GA path: Gemini 2.5 Pro, Gemini 2.5 Flash, Gemini 2.5 Flash-Lite
Pricing and rollout nuance: 3.1 entries show up as preview SKUs on the pricing page, while 2.5 remains deeply embedded in the stable and grounded-prompt pricing guidance

What Google is doing well

Google clearly has a broad and ambitious model stack. The official docs now show a fast-moving Gemini family with distinct model roles for advanced multimodal reasoning, agentic coding, image generation, and lower-latency cost-sensitive work.

That is not the profile of a vendor standing still. If anything, the current docs show Google moving quickly enough that the public documentation, pricing language, and rollout states are overlapping in ways that are easy for outsiders to misread.

Where the frustration still comes from

The complaint I still stand by is not that Google lacks strong models. It is that the developer story is fragmented.

The model catalog is current, but not simple: featured, generally available, and preview labels all matter.
The pricing page is rich, but cognitively heavy: preview SKUs, flex or batch pricing, cached inputs, grounded prompt quotas, and separate rate-limit tiers all add overhead.
The product surfaces still overlap: AI Studio, Gemini API, and Vertex AI each solve slightly different pieces of the same problem.

That is why developers end up feeling confused even when the docs are technically up to date.

The practical guidance now

If you want the newest Google path: look at the featured Gemini 3.x and 3.1 models in the current Vertex AI catalog.

If you want the most clearly production-shaped path: keep an eye on which models Google itself places in the generally available section and how they appear in the live pricing tables.

If you are comparing cost and rollout status: do not assume the newest named model and the most operationally settled model are the same thing.

The corrected bottom line

The real story is not "Google is on 2.5" and it is not "everything is fully 3.1 now." The truth in Google's own docs is more nuanced: Gemini 3.1 is current and featured, while Gemini 2.5 is still highly visible in generally available and pricing-oriented production guidance.

That is exactly why the Google ecosystem feels powerful and messy at the same time.

Cover image attribution: official screenshot captured from Google Cloud Vertex AI documentation on 2026-04-08: Google models | Generative AI on Vertex AI.

Sources used for this correction:

Share this article

Find this interesting? Spread the word.

Back to Articles

4/8/20264 min read

Google Vertex AI & Gemini 3.1: Powerful Models, Fragmented Rollout, Real Trade-Offs

#AI #Google #Gemini #Vertex AI #Development

The current Google picture is more complicated than I wrote before

So what is actually current?

That means developers are dealing with a mixed reality:

Newest featured path: Gemini 3.1 Pro, Gemini 3 Flash, Gemini 3.1 Flash-Lite
Still-prominent GA path: Gemini 2.5 Pro, Gemini 2.5 Flash, Gemini 2.5 Flash-Lite
Pricing and rollout nuance: 3.1 entries show up as preview SKUs on the pricing page, while 2.5 remains deeply embedded in the stable and grounded-prompt pricing guidance

What Google is doing well

Where the frustration still comes from

The complaint I still stand by is not that Google lacks strong models. It is that the developer story is fragmented.

The model catalog is current, but not simple: featured, generally available, and preview labels all matter.
The pricing page is rich, but cognitively heavy: preview SKUs, flex or batch pricing, cached inputs, grounded prompt quotas, and separate rate-limit tiers all add overhead.
The product surfaces still overlap: AI Studio, Gemini API, and Vertex AI each solve slightly different pieces of the same problem.

That is why developers end up feeling confused even when the docs are technically up to date.

The practical guidance now

If you want the newest Google path: look at the featured Gemini 3.x and 3.1 models in the current Vertex AI catalog.

If you want the most clearly production-shaped path: keep an eye on which models Google itself places in the generally available section and how they appear in the live pricing tables.

If you are comparing cost and rollout status: do not assume the newest named model and the most operationally settled model are the same thing.

The corrected bottom line

That is exactly why the Google ecosystem feels powerful and messy at the same time.

Cover image attribution: official screenshot captured from Google Cloud Vertex AI documentation on 2026-04-08: Google models | Generative AI on Vertex AI.

Sources used for this correction:

Share this article

Find this interesting? Spread the word.

Google Vertex AI & Gemini 3.1: Powerful Models, Fragmented Rollout, Real Trade-Offs

The current Google picture is more complicated than I wrote before

So what is actually current?

What Google is doing well

Where the frustration still comes from

The practical guidance now

The corrected bottom line

Share this article

Related Articles

MiniMax M2.7 and the Token Plan: A New Kind of AI Pricing

The Coming AI Wave: Claude Mythos Is Real, but OpenAI's Next Step Is Still Unconfirmed

Visual Reading Online: Reading Made Accessible

Find more articles or get in touch

Google Vertex AI & Gemini 3.1: Powerful Models, Fragmented Rollout, Real Trade-Offs

The current Google picture is more complicated than I wrote before

So what is actually current?

What Google is doing well

Where the frustration still comes from

The practical guidance now

The corrected bottom line

Share this article

Related Articles

MiniMax M2.7 and the Token Plan: A New Kind of AI Pricing

The Coming AI Wave: Claude Mythos Is Real, but OpenAI's Next Step Is Still Unconfirmed

Visual Reading Online: Reading Made Accessible

Find more articles or get in touch