Preços públicos de API LLM

Models

Model Detail

DeepSeek V4 Flash

DeepSeek · Text · Chat completions

Lightweight model for cost-sensitive chat, basic generation, and batch text tasks.

Updated 2026-05-07

Input

0,14 USD / 1M tokens

Output

0,28 USD / 1M tokens

Context window

1.000.000 tokens

Max output

Not provided

Capabilities

Chat

Knowledge cutoff

Not provided

Reasoning model

Yes

Primary unit

1M tokens

Official and future third-party prices are shown in a shared unit.

Evidence rows

1

Each price row keeps source site, observed time, and excerpt.

Quick verify

Verify on Artificial Analysis

Start from this summary, then cross-check official sources or leaderboards.

Use cases

Good for

Chat de baixo custoGeração de texto leveResumos e reescritas em lote

Not ideal for

Fluxos de raciocínio complexosOrquestração de agentes pesadaTarefas multimodais exigentes

Artificial Analysis snapshot

Fields below come from the AA model page for quick cross-checking.

AA Intelligence

46,52

AA Input price

Not provided

AA Output price

Not provided

Input modalities

Text

Output modalities

Text

Pricing details

Structured official and third-party prices with extensible dimensions.

Official pricing

Input0,14 USD / 1M tokens
Output0,28 USD / 1M tokens
CacheNot provided
BatchPending

Third-party provider pricing

No third-party provider pricing yet
Reserved dimensions: Input / Output / Cache / Batch
统一单位:1M tokens

Sources and evidence

Each price line includes a source chain for trust and verification.

  • FieldEntrada / Saída

    ScenarioChat completions

    Captured value0,14 USD / 1M tokens

    Source domainapi-docs.deepseek.com

    Observed at2026-05-07

    ExcerptDeepSeek API pricing table — deepseek-v4-flash: 1M INPUT TOKENS (CACHE MISS) --- DeepSeek API pricing table — deepseek-v4-flash: 1M INPUT TOKENS (CACHE HIT) --- DeepSeek API pricing table — deepseek-v4-flash: 1M OUTPUT TOKENS