Model Detail

DeepSeek V4 Flash

DeepSeek · Text · Chat completions

Lightweight model for cost-sensitive chat, basic generation, and batch text tasks.

Updated 2026-05-07

View API pricing View official source Compare similar models Verify on Artificial Analysis Browse category

Input

0,14 USD / 1M tokens

Output

0,28 USD / 1M tokens

Context window

1.000.000 tokens

Max output

Not provided

Capabilities

Chat

Knowledge cutoff

Not provided

Reasoning model

Yes

Primary unit

1M tokens

Official and future third-party prices are shown in a shared unit.

Evidence rows

Each price row keeps source site, observed time, and excerpt.

Quick verify

Verify on Artificial Analysis

Start from this summary, then cross-check official sources or leaderboards.

Use cases

Good for

Chat de baixo custoGeração de texto leveResumos e reescritas em lote

Not ideal for

Fluxos de raciocínio complexosOrquestração de agentes pesadaTarefas multimodais exigentes

Artificial Analysis snapshot

Fields below come from the AA model page for quick cross-checking.

AA Intelligence

46,52

AA Input price

Not provided

AA Output price

Not provided

Input modalities

Text

Output modalities

Text

Source domain

artificialanalysis.ai

Pricing details

Structured official and third-party prices with extensible dimensions.

Official pricing

Input0,14 USD / 1M tokens

Output0,28 USD / 1M tokens

CacheNot provided

BatchPending

Third-party provider pricing

No third-party provider pricing yet

Reserved dimensions: Input / Output / Cache / Batch

统一单位：1M tokens

Sources and evidence

Each price line includes a source chain for trust and verification.

FieldEntrada / Saída
ScenarioChat completions
Captured value0,14 USD / 1M tokens
Source domainapi-docs.deepseek.com
Observed at2026-05-07
ExcerptDeepSeek API pricing table — deepseek-v4-flash: 1M INPUT TOKENS (CACHE MISS) --- DeepSeek API pricing table — deepseek-v4-flash: 1M INPUT TOKENS (CACHE HIT) --- DeepSeek API pricing table — deepseek-v4-flash: 1M OUTPUT TOKENS