LLM Releases
← Catalog

DeepSeekMoE 16B

Available
DeepSeekOpen source

Early DeepSeek sparse MoE research model that foreshadowed the later V2/V3 architecture direction.

Specifications

License
Open source · DeepSeek License
Weights
Downloadable
Architecture
Mixture-of-Experts
Parameters
16B · 2.8B active
Context window
4K tokens
Max output
Knowledge cutoff
Price (in / out, $/M)
Modalities
TextCode

Benchmarks

No benchmark scores recorded yet. Spotted some? Submit a correction.

Vendor-reported figures are claims until independently verified. See methodology.