Lab release history
Last updated Apr 24, 2026
DeepSeek model releases
Chinese lab shipping permissively licensed frontier-class models. This page collects the lab's model releases, lifecycle events, source links, and model metadata in one crawlable record.
18 models
DeepSeek V4-Flash
PreviewEfficient V4 companion model with 284B total / 13B active parameters and the same one-million-token context window.
DeepSeek V4-Pro
PreviewPreview-series sparse MoE flagship with a one-million-token context window and 1.6T total / 49B active parameters.
DeepSeek-V3.2
AvailableReasoning-first agent model that adds DeepSeek Sparse Attention and thinking directly inside tool-use workflows.
DeepSeek-V3.2-Speciale
AvailableHigh-compute reasoning variant of V3.2, positioned for olympiad-level math, programming, and other deep reasoning tasks.
DeepSeek-V3.2-Exp
PreviewExperimental checkpoint that introduced DeepSeek Sparse Attention as an efficiency bridge between V3.1-Terminus and V3.2.
DeepSeek-V3.1-Terminus
AvailableStability update to V3.1 focused on language consistency, code-agent reliability, and search-agent behavior.
DeepSeek-V3.1
AvailableHybrid thinking/non-thinking release that upgraded tool calling, long-context training, and agent task performance.
DeepSeek-R1-0528
AvailableMajor R1 reasoning update with stronger math, programming, general logic, function calling, and reduced hallucinations.
DeepSeek-V3-0324
AvailablePost-R1 V3 update with improved reasoning, front-end coding, Chinese writing, search, and function calling.
DeepSeek-R1
AvailableBreakout open reasoning model trained with large-scale reinforcement learning and released with weights under MIT.
DeepSeek-V3
AvailableThe 671B/37B-active MoE release that made DeepSeek a central open-model lab before the R1 breakthrough.
DeepSeek-R1-Lite-Preview
RetiredReasoning-preview model exposed in DeepSeek Chat ahead of the open DeepSeek-R1 release.
DeepSeek-V2.5
AvailableUnified DeepSeek V2 generation combining general-chat and coding strengths before the V3 series.
DeepSeek-Coder-V2
AvailableOpen code-focused MoE built from DeepSeek-V2, expanding programming-language coverage and coding benchmark performance.
DeepSeek-V2
AvailableDeepSeek's first major MoE general model with Multi-head Latent Attention and low-cost API positioning.
DeepSeekMoE 16B
AvailableEarly DeepSeek sparse MoE research model that foreshadowed the later V2/V3 architecture direction.
DeepSeek LLM 67B
AvailableFirst general DeepSeek language model family, with 7B and 67B base/chat checkpoints.
DeepSeek Coder 33B
AvailableDeepSeek's first public code-model family, released before the general DeepSeek LLM line.