← Catalog
DeepSeekMoE 16B
AvailableDeepSeekOpen source
Early DeepSeek sparse MoE research model that foreshadowed the later V2/V3 architecture direction.
Specifications
- License
- Open source · DeepSeek License
- Weights
- Downloadable
- Architecture
- Mixture-of-Experts
- Parameters
- 16B · 2.8B active
- Context window
- 4K tokens
- Max output
- —
- Knowledge cutoff
- —
- Price (in / out, $/M)
- —
- Modalities
- TextCode
Benchmarks
No benchmark scores recorded yet. Spotted some? Submit a correction.
Vendor-reported figures are claims until independently verified. See methodology.