LLM Releases

Thinking, math, and agents

Last updated Jun 18, 2026

Reasoning model releases

Models positioned for reasoning, long-horizon tool use, math, coding agents, and other workloads where deliberate problem-solving is the headline feature.

83
Models
25
Labs
55
Open
4
Recent

83 models

Kimi K2.7 Code

Available
Moonshot AIFrontierOpen weights

Moonshot's open coding-focused agentic model built on K2.6, with native vision/video input, forced thinking mode, and stronger long-horizon software-engineering performance.

MoE1T262K ctxJun 18, 2026

GLM-5.2

Available
Z.ai (Zhipu AI)FrontierOpen source

Z.ai's latest open flagship for long-horizon coding, agentic engineering, and million-token workflows, adding IndexShare sparse-attention reuse over GLM-5.1.

MoE753B1M ctxJun 17, 2026

MiniMax-M3

Available
MiniMaxFrontierOpen weights

Native multimodal MiniMax model with a one-million-token context, sparse attention, and agentic coding/cowork positioning.

MoE428B1M ctxJun 16, 2026

GPT-5.6

Preview
OpenAIFrontierProprietary

OpenAI's mid-2026 flagship, headlined by an industry-leading 1.5M-token context window and long-horizon agentic tool use.

MoEUndisc.1.5M ctxJun 9, 2026

Claude Fable 5

Withdrawn
AnthropicFrontierProprietary

The public, guardrailed sibling of Mythos and Anthropic's most capable widely-released model, built for long-horizon agentic work. Launched June 9, 2026 across the Claude API, AWS, and Microsoft Foundry — then pulled three days later under a US government export-control directive barring access by foreign nationals.

Undisc. ctxJun 9, 2026

Nemotron 3 Ultra 550B-A55B

Available
NVIDIAFrontierOpen weights

NVIDIA's largest Nemotron 3 open-weight hybrid Mamba-Transformer MoE, tuned for agentic reasoning, coding, planning, and tool calling.

Hybrid550B1M ctxJun 4, 2026

Claude Opus 4.8

Available
AnthropicFrontierProprietary

Anthropic's most capable model, with strengthened agentic and long-running task performance.

Undisc.500K ctxMay 28, 2026

MiniMax-M2.7

Available
MiniMaxFrontierOpen weights

Open-weight agentic model from MiniMax focused on real-world software engineering, office tasks, tool use, and self-improving training workflows.

MoE229.9B ctxMay 26, 2026

Gemini 3.5 Pro

Preview
Google DeepMindFrontierProprietary

Announced at Google I/O 2026; emphasizes deep multimodal reasoning over a 2M-token context.

MoEUndisc.2M ctxMay 19, 2026

Qwen3.6-27B

Available
Alibaba (Qwen)Open source

Dense 27B that punches far above its weight on agentic coding — easy to self-host on a single GPU node.

Dense27B256K ctxMay 12, 2026

Grok 4.3

Available
xAIFrontierProprietary

xAI's agentic flagship with a 1M-token context and aggressive API pricing.

MoEUndisc.1M ctxMay 6, 2026

Hunyuan-A13B-Instruct

Available
Tencent HunyuanOpen weights

Tencent Hunyuan open-weight fine-grained MoE model with 80B total parameters and 13B active parameters, optimized for agentic tool use.

MoE80B ctxApr 22, 2026

GLM-5.1

Available
Z.ai (Zhipu AI)FrontierOpen source

Z.ai agentic-engineering follow-up to GLM-5, with stronger coding performance and better long-horizon tool-use behavior.

MoE754B ctxApr 8, 2026

Gemma 4 31B

Available
Google DeepMindOpen source

Google DeepMind's Gemma 4 advanced-reasoning open model for personal computers, part of the April 2026 Gemma 4 family.

Dense31B ctxApr 2, 2026

Kimi K2.6

Available
Moonshot AIFrontierOpen weights

Moonshot's open native multimodal agentic model for long-horizon coding, visual interface generation, and autonomous tool orchestration.

MoE1T256K ctxMar 30, 2026

Step-3.5-Flash

Available
StepFunOpen source

StepFun's Apache-licensed sparse MoE model for fast agentic execution, coding, math, browsing, and tool-use workflows.

MoE196B256K ctxMar 14, 2026

Sarvam-105B

Available
Sarvam AIOpen source

Apache-licensed Indian-context MoE from Sarvam AI, optimized for reasoning, coding, agentic tasks, and 22 Indian languages.

MoE105B128K ctxMar 6, 2026

GPT-5.4

Available
OpenAIFrontierProprietary

Workhorse GPT-5 release with a dedicated Thinking mode; widely deployed across ChatGPT and the API.

MoEUndisc.400K ctxMar 5, 2026

GLM-5

Available
Z.ai (Zhipu AI)FrontierOpen source

Z.ai flagship for complex systems engineering and long-horizon agentic tasks, scaling the GLM line to 744B total / 40B active parameters.

MoE744B ctxFeb 11, 2026

Kimi K2.5

Available
Moonshot AIFrontierOpen weights

Open multimodal Kimi model that adds native visual agentic intelligence, instant and thinking modes, and agent-swarm workflows on top of the K2 base.

MoE1T256K ctxJan 27, 2026

GLM-4.7

Available
Z.ai (Zhipu AI)FrontierOpen source

Coding-focused GLM release with improved multilingual agentic coding, terminal tasks, tool use, and interface generation.

MoE358B ctxJan 8, 2026

OLMo 3 Think 32B

Available
Allen Institute for AI (Ai2)Open source

Ai2's fully open thinking model with public weights, code, data, checkpoints, and training details across the OLMo 3 pipeline.

Dense32B ctxDec 15, 2025

Nemotron 3 Nano 30B-A3B

Available
NVIDIAOpen weights

Efficient Nemotron 3 MoE checkpoint for agentic reasoning and coding, activating about 3B parameters while supporting 1M-token contexts.

Hybrid30B1M ctxDec 15, 2025

Mistral Large 3

Available
Mistral AIFrontierOpen weights

Mistral's largest open-weight MoE, aimed at frontier reasoning while remaining self-hostable.

MoE675B256K ctxDec 2, 2025

DeepSeek-V3.2

Available
DeepSeekFrontierOpen source

Reasoning-first agent model that adds DeepSeek Sparse Attention and thinking directly inside tool-use workflows.

MoE685B128K ctxDec 1, 2025

DeepSeek-V3.2-Speciale

Available
DeepSeekFrontierOpen source

High-compute reasoning variant of V3.2, positioned for olympiad-level math, programming, and other deep reasoning tasks.

MoE685B128K ctxDec 1, 2025

Kimi K2 Thinking

Available
Moonshot AIFrontierOpen weights

Open K2 reasoning-agent variant that interleaves step-by-step thinking with tool calls and supports stable 200-300 step tool-use trajectories.

MoE1T256K ctxNov 6, 2025

GLM-4.6

Available
Z.ai (Zhipu AI)FrontierOpen source

Agentic reasoning and coding upgrade over GLM-4.5, expanding the text context window from 128K to 200K tokens.

MoE357B200K ctxSep 30, 2025

Kimi K2 Instruct 0905

Available
Moonshot AIFrontierOpen weights

September 2025 K2 update with stronger agentic coding, better frontend generation, and a doubled 256K context window.

MoE1T256K ctxSep 5, 2025

DeepSeek-V3.1

Available
DeepSeekOpen source

Hybrid thinking/non-thinking release that upgraded tool calling, long-context training, and agent task performance.

MoE671B128K ctxAug 21, 2025

Seed-OSS-36B-Instruct

Available
ByteDance SeedOpen source

ByteDance Seed's Apache-licensed long-context reasoning and agent model, with controllable thinking budgets and a native 512K context.

Dense36B512K ctxAug 20, 2025

gpt-oss-20b

Available
OpenAIOpen source

Smaller gpt-oss reasoning model optimized for local inference on systems with about 16GB of memory.

MoE21B128K ctxAug 5, 2025

gpt-oss-120b

Available
OpenAIOpen source

OpenAI's larger open-weight reasoning model, a 117B-total / 5.1B-active MoE with 128K context for local and self-hosted deployment.

MoE117B128K ctxAug 5, 2025

GLM-4.5

Available
Z.ai (Zhipu AI)FrontierOpen source

Open agentic, reasoning, and coding foundation model that marked Z.ai international rebrand and MIT-licensed GLM push.

MoE355B128K ctxJul 28, 2025

GLM-4.5-Air

Available
Z.ai (Zhipu AI)Open source

Compact GLM-4.5 companion with 106B total / 12B active parameters for efficient agentic reasoning and coding.

MoE106B128K ctxJul 28, 2025

EXAONE 4.0 32B

Available
LG AI ResearchOpen weights

LG AI Research's unified model with non-reasoning and reasoning modes, agentic tool use, and English, Korean, and Spanish support.

Dense32B ctxJul 15, 2025

Kimi K2 Instruct

Available
Moonshot AIFrontierOpen weights

Original open K2 post-trained model: a 1T-parameter MoE optimized for coding, reasoning, and tool-using agentic workflows.

MoE1T128K ctxJul 11, 2025

SmolLM3 3B

Available
Hugging FaceOpen source

Hugging Face's fully open 3B multilingual long-context model with optional reasoning mode and 128K context.

Dense3B128K ctxJul 8, 2025

ERNIE-4.5-VL-424B-A47B

Available
BaiduOpen source

Baidu's largest ERNIE 4.5 vision-language MoE, supporting text, image, and video inputs with thinking and non-thinking modes.

MoE424B128K ctxJun 30, 2025

Kimi-VL-A3B-Thinking-2506

Available
Moonshot AIOpen source

Updated MIT-licensed Kimi-VL reasoning model with better multimodal reasoning, video understanding, high-resolution perception, and lower thinking-token use.

MoE16B128K ctxJun 21, 2025

MiniMax-M1-80k

Available
MiniMaxFrontierOpen source

Open Apache-licensed hybrid-attention reasoning model with 456B total / 45.9B active parameters and a native 1M-token context.

Hybrid456B1M ctxJun 16, 2025

Magistral Medium

Available
Mistral AIProprietary

Mistral's first dedicated reasoning model family, released in Small open-weight and Medium enterprise/API tiers.

Undisc. ctxJun 10, 2025

Magistral Small

Available
Mistral AIOpen weights

Open-weight 24B reasoning model from Mistral's Magistral family, popular for local reasoning experiments.

Dense24B40K ctxJun 10, 2025

DeepSeek-R1-0528

Available
DeepSeekFrontierOpen source

Major R1 reasoning update with stronger math, programming, general logic, function calling, and reduced hallucinations.

MoE671B128K ctxMay 28, 2025

Claude Opus 4

Deprecated
AnthropicProprietary

First Claude 4 Opus model, positioned for long-running agentic and coding work before the 4.x point releases.

Undisc.200K ctxMay 22, 2025

Seed Thinking v1.5

Available
ByteDance SeedProprietary

ByteDance Seed reasoning model focused on long-horizon thinking and problem solving.

Undisc. ctxMay 22, 2025

Sarvam-M

Available
Sarvam AIOpen weights

Sarvam's medium-scale open model for multilingual Indian-language chat, reasoning, and translation tasks.

DenseUndisc. ctxMay 21, 2025

Phi-4 Reasoning

Available
MicrosoftOpen weights

Phi-4 reasoning-specialized model family for math, science, and chain-of-thought style tasks.

Dense14B ctxApr 30, 2025

Qwen3-235B-A22B

Available
Alibaba (Qwen)Open source

Largest open Qwen3 MoE, introducing hybrid thinking/non-thinking modes and 119-language coverage.

MoE235B128K ctxApr 28, 2025

OpenAI o3

Available
OpenAIProprietary

Reasoning model released alongside o4-mini with tool use, image reasoning, and stronger agentic problem solving.

Undisc. ctxApr 16, 2025

Llama-3.3-Nemotron-Super-49B

Available
NVIDIAOpen weights

Open Llama Nemotron reasoning model from NVIDIA's 2025 Nemotron family.

Dense49B128K ctxApr 2, 2025

DeepSeek-V3-0324

Available
DeepSeekOpen source

Post-R1 V3 update with improved reasoning, front-end coding, Chinese writing, search, and function calling.

MoE671B128K ctxMar 25, 2025

Gemini 2.5 Pro

Deprecated
Google DeepMindProprietary

Reasoning-focused Gemini 2.5 model that made thinking a core part of Google's flagship model line.

Undisc.1M ctxMar 25, 2025

ERNIE X1

Available
BaiduProprietary

Baidu's reasoning model released alongside ERNIE 4.5 before the open ERNIE 4.5 weights.

Undisc. ctxMar 16, 2025

Granite 3.2 8B

Available
IBMOpen source

Granite 3.2 update with reasoning controls and multimodal/document-oriented Granite variants.

Dense8B128K ctxFeb 26, 2025

Claude 3.7 Sonnet

Retired
AnthropicProprietary

Anthropic's first hybrid-reasoning Sonnet. Shut down May 11, 2026 as the 4.x line matured.

Undisc.200K ctxFeb 24, 2025

DeepHermes 3 Llama 3 8B

Available
Nous ResearchOpen weights

Nous reasoning-oriented Hermes model trained to combine concise answers with optional deep reasoning traces.

Dense8B8K ctxFeb 18, 2025

Grok 3

Deprecated
xAIProprietary

xAI's third-generation model family, introduced with stronger reasoning, search, and coding modes.

Undisc. ctxFeb 17, 2025

Dolphin 3.0 Llama 3.1 8B

Available
Cognitive ComputationsOpen weights

Popular local assistant model tuned for coding, math, function calling, and agentic workflows.

Dense8B128K ctxFeb 2, 2025

Qwen2.5-VL-72B

Available
Alibaba (Qwen)Open weights

Vision-language Qwen2.5 model for image, document, video, and agentic visual grounding tasks.

Dense72B128K ctxJan 26, 2025

Doubao-1.5-pro

Available
ByteDance SeedProprietary

Doubao 1.5 Pro update positioned for stronger multimodal, reasoning, and agentic work in Volcano Engine.

Undisc. ctxJan 22, 2025

DeepSeek-R1

Available
DeepSeekFrontierOpen source

Breakout open reasoning model trained with large-scale reinforcement learning and released with weights under MIT.

MoE671B128K ctxJan 20, 2025

Kimi k1.5

Available
Moonshot AIProprietary

Moonshot's multimodal reinforcement-learning reasoning model, reported as matching OpenAI o1 on math, coding, and multimodal reasoning.

Undisc. ctxJan 20, 2025

Step-2

Available
StepFunProprietary

Second-generation StepFun foundation model line with larger-scale multimodal and reasoning ambitions.

Undisc. ctxDec 23, 2024

Phi-4

Available
MicrosoftOpen source

A 14B dense model that rivals far larger ones on math and reasoning, under a permissive MIT license.

Dense14B16K ctxDec 12, 2024

Gemini 2.0 Flash

Deprecated
Google DeepMindProprietary

First Gemini 2.0 release, built for native multimodal input/output, tool use, and agentic product integrations.

Undisc.1M ctxDec 11, 2024

EXAONE 3.5 32B

Available
LG AI ResearchOpen weights

EXAONE 3.5 32B open-weight model for bilingual reasoning, coding, and long-context tasks.

Dense32B32K ctxDec 9, 2024

OpenAI o1

Deprecated
OpenAIProprietary

General release of OpenAI's o1 reasoning model with stronger deliberative reasoning and multimodal ChatGPT integration.

Undisc. ctxDec 5, 2024

QwQ-32B-Preview

Available
Alibaba (Qwen)Open source

Qwen's first public reasoning-preview model, aimed at math, coding, and deliberate problem solving.

Dense32B32K ctxNov 28, 2024

DeepSeek-R1-Lite-Preview

Retired
DeepSeekProprietary

Reasoning-preview model exposed in DeepSeek Chat ahead of the open DeepSeek-R1 release.

Undisc. ctxNov 20, 2024

Yi-Lightning

Available
01.AIProprietary

01.AI's MoE API model that reached the global top-10 on Chatbot Arena, strong in Chinese, math, and coding.

MoEUndisc. ctxOct 16, 2024

Qwen2.5-72B

Available
Alibaba (Qwen)Open weights

Broad Qwen2.5 foundation-model update spanning general, coding, math, and multimodal descendants.

Dense72B128K ctxSep 19, 2024

OpenAI o1-preview

Retired
OpenAIProprietary

OpenAI's first public reasoning-model preview, optimized to spend more inference time on hard math, coding, and science tasks.

Undisc. ctxSep 12, 2024

Grok-2

Retired
xAIProprietary

Second-generation Grok release with Grok-2 and Grok-2 mini for chat, coding, reasoning, and image-enabled product experiences.

Undisc. ctxAug 13, 2024

Claude 3.5 Sonnet

Retired
AnthropicProprietary

Major Sonnet upgrade that became Anthropic's default high-intelligence workhorse for coding, writing, and visual reasoning.

Undisc.200K ctxJun 20, 2024

Qwen2-72B

Available
Alibaba (Qwen)Open weights

Qwen2's largest dense model, introducing stronger multilingual support, coding/math gains, and long-context variants.

Dense72B128K ctxJun 7, 2024

Yi-1.5-34B

Available
01.AIOpen weights

Yi 1.5 update with stronger instruction following, coding, math, and multilingual performance.

Dense34B4K ctxMay 13, 2024

Grok-1.5

Retired
xAIProprietary

Grok update with stronger reasoning and a 128K context window.

Undisc.128K ctxMar 28, 2024

DBRX Instruct

Available
Databricks / MosaicMLOpen weights

Databricks' 132B-total / 36B-active open MoE model for code, math, RAG, and enterprise self-hosted workloads.

MoE132B32K ctxMar 27, 2024

Phi-2

Available
MicrosoftOpen weights

2.7B-parameter Phi model showing strong reasoning and language understanding at small scale.

Dense2.7B ctxDec 12, 2023

ERNIE 4.0

Available
BaiduProprietary

Baidu's fourth-generation ERNIE flagship, announced with stronger understanding, generation, reasoning, and memory.

Undisc. ctxOct 17, 2023

PaLM 2

Retired
Google DeepMindProprietary

Google's improved multilingual, reasoning, and coding foundation model family introduced at I/O 2023.

DenseUndisc. ctxMay 10, 2023

GPT-4

Deprecated
OpenAIProprietary

The model that brought reliable multi-step reasoning to the mainstream; size never disclosed.

Undisc.8K ctxMar 14, 2023