LLM Releases.com

Lab release history

Last updated Apr 24, 2026

DeepSeek model releases

Chinese lab shipping permissively licensed frontier-class models. This page collects the lab's model releases, lifecycle events, source links, and model metadata in one crawlable record.

18

Models

1

Labs

17

Open

4

Recent

All models·Latest releases·Open-weight models·Changelog·Home

Most recent in this set

DeepSeek V4-Flash

DeepSeek

DeepSeek V4-Pro

DeepSeek

DeepSeek-V3.2

DeepSeek

DeepSeek-V3.2-Speciale

DeepSeek

Sort

18 models

DeepSeek V4-Flash

DeepSeekOpen source

Efficient V4 companion model with 284B total / 13B active parameters and the same one-million-token context window.

MoE284B1M ctxApr 24, 2026

DeepSeek V4-Pro

DeepSeekFrontierOpen source

Preview-series sparse MoE flagship with a one-million-token context window and 1.6T total / 49B active parameters.

MoE1.6T1M ctxApr 24, 2026

DeepSeek-V3.2

DeepSeekFrontierOpen source

Reasoning-first agent model that adds DeepSeek Sparse Attention and thinking directly inside tool-use workflows.

MoE685B128K ctxDec 1, 2025

DeepSeek-V3.2-Speciale

DeepSeekFrontierOpen source

High-compute reasoning variant of V3.2, positioned for olympiad-level math, programming, and other deep reasoning tasks.

MoE685B128K ctxDec 1, 2025

DeepSeek-V3.2-Exp

DeepSeekOpen source

Experimental checkpoint that introduced DeepSeek Sparse Attention as an efficiency bridge between V3.1-Terminus and V3.2.

MoE685B128K ctxSep 29, 2025

DeepSeek-V3.1-Terminus

DeepSeekOpen source

Stability update to V3.1 focused on language consistency, code-agent reliability, and search-agent behavior.

MoE685B128K ctxSep 22, 2025

DeepSeek-V3.1

DeepSeekOpen source

Hybrid thinking/non-thinking release that upgraded tool calling, long-context training, and agent task performance.

MoE671B128K ctxAug 21, 2025

DeepSeek-R1-0528

DeepSeekFrontierOpen source

Major R1 reasoning update with stronger math, programming, general logic, function calling, and reduced hallucinations.

MoE671B128K ctxMay 28, 2025

DeepSeek-V3-0324

DeepSeekOpen source

Post-R1 V3 update with improved reasoning, front-end coding, Chinese writing, search, and function calling.

MoE671B128K ctxMar 25, 2025

DeepSeek-R1

DeepSeekFrontierOpen source

Breakout open reasoning model trained with large-scale reinforcement learning and released with weights under MIT.

MoE671B128K ctxJan 20, 2025

DeepSeek-V3

DeepSeekOpen source

The 671B/37B-active MoE release that made DeepSeek a central open-model lab before the R1 breakthrough.

MoE671B128K ctxDec 26, 2024

DeepSeek-R1-Lite-Preview

DeepSeekProprietary

Reasoning-preview model exposed in DeepSeek Chat ahead of the open DeepSeek-R1 release.

—Undisc.— ctxNov 20, 2024

DeepSeek-V2.5

DeepSeekOpen source

Unified DeepSeek V2 generation combining general-chat and coding strengths before the V3 series.

MoE236B128K ctxSep 5, 2024

DeepSeek-Coder-V2

DeepSeekOpen source

Open code-focused MoE built from DeepSeek-V2, expanding programming-language coverage and coding benchmark performance.

MoE236B128K ctxJun 17, 2024

DeepSeek-V2

DeepSeekOpen source

DeepSeek's first major MoE general model with Multi-head Latent Attention and low-cost API positioning.

MoE236B128K ctxMay 7, 2024

DeepSeekMoE 16B

DeepSeekOpen source

Early DeepSeek sparse MoE research model that foreshadowed the later V2/V3 architecture direction.

MoE16B4K ctxJan 11, 2024

DeepSeek LLM 67B

DeepSeekOpen source

First general DeepSeek language model family, with 7B and 67B base/chat checkpoints.

Dense67B4K ctxNov 29, 2023

DeepSeek Coder 33B

DeepSeekOpen source

DeepSeek's first public code-model family, released before the general DeepSeek LLM line.

Dense33B16K ctxNov 2, 2023