Public LLM API Prices

Models

Model Detail

Gemini 3.5 Flash

Google · Text · Text models

Lightweight model for cost-sensitive chat, basic generation, and batch text tasks.

Updated 2026-05-20

Input

1.5 USD / 1M tokens

Output

9 USD / 1M tokens

Context window

1,000,000 tokens

Max output

Not provided

Capabilities

Chat
Cache

Knowledge cutoff

Not provided

Reasoning model

Yes

Primary unit

1M tokens

Official and future third-party prices are shown in a shared unit.

Evidence rows

1

Each price row keeps source site, observed time, and excerpt.

Quick verify

Verify on Artificial Analysis

Start from this summary, then cross-check official sources or leaderboards.

Use cases

Good for

Low-cost chatLightweight text generationBatch summarization and rewriting

Not ideal for

Complex reasoning workflowsHeavy agent orchestrationDemanding multimodal tasks

Artificial Analysis snapshot

Fields below come from the AA model page for quick cross-checking.

AA Intelligence

55

AA Input price

1.5 USD / 1M tokens

AA Output price

9 USD / 1M tokens

Input modalities

Image / speech / Text / Video

Output modalities

Text

Pricing details

Structured official and third-party prices with extensible dimensions.

Official pricing

Input1.5 USD / 1M tokens
Output9 USD / 1M tokens
Cache0.15 USD / 1M tokens
BatchPending

Third-party provider pricing

No third-party provider pricing yet
Reserved dimensions: Input / Output / Cache / Batch
统一单位:1M tokens

Sources and evidence

Each price line includes a source chain for trust and verification.

  • FieldInput / Output / Input

    ScenarioText models

    Captured value1.5 USD / 1M tokens

    Source domainartificialanalysis.ai

    Observed at2026-05-20

    ExcerptArtificial Analysis model page for Gemini 3.5 Flash: input price $1.50 per 1M tokens; cached input $0.15 per 1M tokens. --- Artificial Analysis model page for Gemini 3.5 Flash: output price $9.00 per 1M tokens.