LongCat-2.0
AvailableMeituan's open-weight flagship: a 1.6-trillion-parameter Mixture-of-Experts model (~48B active per token, dynamically routed between ~33B and ~56B) with a 1M-token context, built for agentic coding. Notable as the largest Chinese model trained — for both pre-training and inference — entirely on a ~50,000-card cluster of domestic Chinese AI chips (Meituan's use of the Huawei Collective Communication Library points to Huawei Ascend hardware), and the first trillion-parameter model Meituan claims completed full-process training on home-grown compute. Vendor-reported software-engineering results: 59.5 on SWE-bench Pro (ahead of GPT-5.5's 58.6), 70.8 on Terminal-Bench 2.1, and 77.3 on SWE-bench Multilingual, with overall quality positioned as comparable to Gemini 3.1 Pro (self-reported, not yet independently verified). Open-sourced under the MIT license with weights on Hugging Face and GitHub; follows LongCat-Flash (560B, Sep 2025) and the multimodal LongCat-Next (Mar 2026).
Specifications
- License
- Open source · MIT
- Weights
- Downloadable
- Architecture
- Mixture-of-Experts
- Parameters
- 1.6T · 48B active
- Context window
- 1M tokens
- Max output
- —
- Knowledge cutoff
- —
- Price (in / out, $/M)
- —
- Modalities
- TextCode
Benchmarks
No benchmark scores recorded yet. Spotted some? Submit a correction.
Vendor-reported figures are claims until independently verified. See methodology.