Публичные цены API LLM

Models

Model Detail

gemini-2.5-flash

Google · Text · Chat completions

Lightweight model for cost-sensitive chat, basic generation, and batch text tasks.

Updated 2026-03-29

Input

0,3 USD / 1M tokens

Output

2,5 USD / 1M tokens

Context window

1 000 000 tokens

Max output

Not provided

Capabilities

Chat

Knowledge cutoff

2025-01-01

Reasoning model

No

Primary unit

1M токенов

Official and future third-party prices are shown in a shared unit.

Evidence rows

1

Each price row keeps source site, observed time, and excerpt.

Quick verify

Verify on Artificial Analysis

Start from this summary, then cross-check official sources or leaderboards.

Use cases

Good for

Низкоценетный чатЛегкая генерация текстаПакетное суммирование и переписывание

Not ideal for

Сложные цепочки рассужденийТяжелая оркестрация агентовТребовательные многомодальные задачи

Artificial Analysis snapshot

Fields below come from the AA model page for quick cross-checking.

AA Intelligence

20,56

AA Input price

0,3 USD / 1M tokens

AA Output price

2,5 USD / 1M tokens

Input modalities

Image / Text / Video

Output modalities

Text

Pricing details

Structured official and third-party prices with extensible dimensions.

Official pricing

Input0,3 USD / 1M tokens
Output2,5 USD / 1M tokens
CacheNot provided
BatchPending

Third-party provider pricing

No third-party provider pricing yet
Reserved dimensions: Input / Output / Cache / Batch
统一单位:1M токенов

Sources and evidence

Each price line includes a source chain for trust and verification.

  • FieldВход / Выход

    ScenarioChat completions

    Captured value0,3 USD / 1M tokens

    Source domainai.google.dev

    Observed at2026-03-29

    ExcerptGoogle Gemini devsite: gemini-2.5-flash input (Standard paid tier, first $ in cell) --- Google Gemini devsite: gemini-2.5-flash output (Standard paid tier, first $ in cell)