×

注意!页面内容来自https://api-docs.deepseek.com/quick_start/pricing/,本站不储存任何内容,为了更好的阅读体验进行在线解析,若有广告出现,请及时反馈。若您觉得侵犯了您的利益,请通知我们进行删除,然后访问 原网页

Skip to main content

Models & Pricing

The prices listed below are in units of per 1M tokens. A tokenthe smallest unit of text that the model recognizescan be a worda numberor even a punctuation mark. We will bill based on the total number of input and output tokens by the model.


Model Details

MODELdeepseek-v4-flash(1)deepseek-v4-pro
BASE URL (OpenAI Format)https://api.deepseek.com
BASE URL (Anthropic Format)https://api.deepseek.com/anthropic
MODEL VERSIONDeepSeek-V4-FlashDeepSeek-V4-Pro
THINKING MODESupports both non-thinking and thinking (default) modes
See Thinking Mode for how to switch
CONTEXT LENGTH1M
MAX OUTPUTMAXIMUM: 384K
FEATURESJson Output
Tool Calls
Chat Prefix Completion(Beta)
FIM Completion(Beta)Non-thinking mode onlyNon-thinking mode only
PRICING1M INPUT TOKENS (CACHE HIT)(2)$0.0028$0.003625 (75% off(3))$0.0145
1M INPUT TOKENS (CACHE MISS)$0.14$0.435 (75% off(3))$1.74
1M OUTPUT TOKENS$0.28$0.87 (75% off(3))$3.48

(1) The model names deepseek-chat and deepseek-reasoner will be deprecated in the future. For compatibilitythey correspond to the non-thinking mode and thinking mode of deepseek-v4-flashrespectively.
(2) For all modelsthe input cache hit price has been reduced to 1/10 of the launch price. This price adjustment takes effect from 2026/4/26 12:15 UTC.
(3) The deepseek-v4-pro model is currently offered at a 75% discountextended until 2026/05/31 15:59 UTC.


Deduction Rules

The expense = number of tokens × price. The corresponding fees will be directly deducted from your topped-up balance or granted balancewith a preference for using the granted balance first when both balances are available.

Product prices may vary and DeepSeek reserves the right to adjust them. We recommend topping up based on your actual usage and regularly checking this page for the most recent pricing information.