Which LLM is the best in 2026?

It depends on the task. For coding, Claude Fable 5 leads with 95.0% on SWE-bench Verified, ahead of Claude Opus 4.8 at 88.6%. GPT-5.5 scores 82.6% in the Vals AI harness. For scientific reasoning (GPQA Diamond), Gemini 3.1 Pro leads at 94.3%. In math (AIME 2025), GPT-5 Pro hits a perfect score. There's no single best model anymore. Fresh entry: The GPT-5.6 family (Sol, Terra, Luna) has been generally available since July 9, 2026. Sol hits 88.8% on Terminal-Bench 2.1 (Sol Ultra at 91.9%), 64.6% on SWE-Bench Pro (about 15 points behind Claude Mythos 5 and Fable 5), and 80 on the Artificial Analysis Coding Agent Index (Fable 5: 77.2).

What is the cheapest LLM?

The cheapest model with API access in our database is GPT-5 nano at $0.05 per 1M input tokens and $0.40 per 1M output tokens. The most expensive is GPT-5.5 Pro at $30 input and $180 output. That's a factor of 600 between the cheapest and the priciest model.

Are there more open or closed LLMs?

Of the 157 tracked models, 98 are proprietary and 59 are openly available (54 open-weights, 5 fully open-source). Closed models still lead on raw performance, but the open models are close behind: DeepSeek-V4-Pro reaches 80.6% SWE-bench, roughly 8 percentage points behind Claude Opus 4.8.

How large is the biggest context window?

The largest context windows belong to Llama 4 Scout and Qwen-Long at 10 million tokens each. Current frontier models mostly sit around 1 million tokens, with GPT-5.5 at 1 million and Claude Opus 5 plus Gemini 3.1 Pro at 1 million. For comparison: 10 million tokens is roughly 30 Harry Potter books.

Why do vendors no longer disclose parameter counts?

For every current frontier model from OpenAI, Anthropic, Google, and xAI, the parameter count is officially unknown. The labs treat model size and architecture as a trade secret. Concrete numbers almost only exist for open models like Kimi K3 (an official 2.8 trillion parameters, weights available since July 27, 2026 under Moonshot’s own “Kimi K3 License”), DeepSeek-V4-Pro (1.6 trillion), or Kimi K2.6 (1 trillion).

LLM Statistics 2026: Key Numbers, Data & Facts

Q: How many large language models are there in 2026?

Our centrally maintained database currently tracks 157 large language models from 23 providers, from GPT-2 (2019) to the current flagships. That's a curated selection of the most relevant models, not the total number of LLMs ever released. In 2025 alone, US labs shipped roughly 60 notable models according to the Stanford AI Index, China about 35.

Large language models are the heart of the AI revolution. But how many are there, really? Who builds them? What do they cost? And which model is actually the best?

The short answer:

It has gotten messy. In 2026, a new top-tier model shows up roughly every month, prices swing by a factor of 600, and the single most important metric of the past few years, the parameter count, is something the big labs no longer disclose at all.

In this article, I sort through the numbers. Every value comes from our centrally maintained LLM database, the same one that powers tools like the API cost calculator, and reflects the state of July 2026.

TL;DRKey Takeaways

Our database tracks 157 LLMs from 23 providers, 98 of them proprietary and 59 openly available.
For coding, Claude Fable 5 leads at 95.0% SWE-bench, ahead of Claude Opus 4.8 at 88.6%. GPT-5.5 scores 82.6% in the Vals AI harness. Open-weights models like DeepSeek-V4-Pro trail Claude Opus 4.8 by roughly 8 percentage points.
Prices range from $0.05 (GPT-5 nano) to $30 (GPT-5.5 Pro) per 1M input tokens. Frontier labs no longer disclose parameter counts.

Note

This article looks at a curated selection of the most relevant language models, not every LLM ever released. The figures for model count, providers, and type split are computed directly from our model database, so they always stay current.

1. How Many Large Language Models Are There in 2026?

Our database currently tracks 157 large language models from 23 different providers, from GPT-2 back in 2019 to the latest flagships in July 2026. This is deliberately a curated selection of the most important models, not a claim to completeness.

For context:

According to the Stanford AI Index 2026, US labs alone shipped around 60 notable models in 2025, Chinese providers about 35. More than 90% of all significant frontier models now come from industry rather than academic research. The market has professionalized and concentrated.

2. The Biggest LLM Providers by Model Count

A simple indicator of how active a lab is: the number of models it maintains. The chart below shows how many of the models we track belong to each provider:

Source: gradually.ai LLM database

CC BY 4.0

gradually.ai

OpenAI leads with 35 models, followed by Anthropic with 19 and Google with 18. That number only measures how deeply a lab maintains its lineup, though, not actual usage. Real market share looks different: in AI chatbot web traffic, ChatGPT dominates, while Gemini and Claude follow behind.

3. Parameters and Architecture: The End of Size Disclosures

For years, the parameter count was the most important metric for a model. GPT-3 had 175 billion, GPT-4 an estimated 1.76 trillion. Then the labs stopped reporting the number.

Today the rule is:

For every current frontier model from OpenAI, Anthropic, Google, and xAI, the parameter count is officially unknown. Model size has become a trade secret. Concrete, confirmed numbers almost only exist for open-weights models, and those are huge. The new leader is Moonshot AI's Kimi K3 at an official 2.8 trillion parameters (unveiled July 16, 2026, with weights released July 27 under Moonshot’s own “Kimi K3 License”):

Kimi K3MoE, 896 experts (16 active)

2.8T

DeepSeek-V4-ProMoE, 49B active

1.6T

Kimi K2.6MoE, 32B active

Qwen 3.6 Maxestimated

GLM-5.2MoE, 40B active

744B

DeepSeek V3.2MoE, 37B active

685B

Mistral Large 3MoE, 41B active

675B

Llama 4 MaverickMoE, 17B active

400B

Grok-1MoE (2024)

314B

Source: gradually.ai LLM database

CC BY 4.0

gradually.ai

The architecture is the striking part. Almost all large models today use a Mixture-of-Experts (MoE) design, where only a fraction of the parameters is active per request. DeepSeek-V4-Pro has 1.6 trillion parameters but activates only 49 billion per token, around 3%. That makes giant models affordable to run. In total, 44 of the tracked models are built as MoE.

You can filter and search the full parameter database by provider, size, and type below. For most current frontier models, the parameter column deliberately reads "unknown":

Legend:

500B+

100-500B

20-100B

5-20B

Under 5B

Showing 157 models

Parameter sizes of popular Large Language Models (as of May 2026)
Model	Developer	Parameters	Type	Released
GPT-5.6 Sol	OpenAI	Unknown	Proprietary	Jun 2026
GPT-5.6 Terra	OpenAI	Unknown	Proprietary	Jun 2026
GPT-5.6 Luna	OpenAI	Unknown	Proprietary	Jun 2026
GPT-5.5	OpenAI	Unknown	Proprietary	Apr 2026
GPT-5.5 Pro	OpenAI	Unknown	Proprietary	Apr 2026
GPT-5.5 Instant	OpenAI	Unknown	Proprietary	May 2026
ChatGPT chat-latest	OpenAI	Unknown	Proprietary	Jun 2026
GPT-5.4	OpenAI	Unknown	Proprietary	Mar 2026
GPT-5.4 Pro	OpenAI	Unknown	Proprietary	Mar 2026
GPT-5.4 mini	OpenAI	Unknown	Proprietary	Mar 2026
GPT-5.4 nano	OpenAI	Unknown	Proprietary	Mar 2026
GPT-5.3-Codex	OpenAI	Unknown	Proprietary	Feb 2026
GPT-5.3 Instant	OpenAI	Unknown	Proprietary	Feb 2026
GPT-5.2	OpenAI	Unknown	Proprietary	Dec 2025
GPT-5.1 Instant	OpenAI	Unknown	Proprietary	Nov 2025
GPT-5.1 Thinking	OpenAI	Unknown	Proprietary	Nov 2025
GPT-5	OpenAI	Unknown	Proprietary	Aug 2025
GPT-5 pro	OpenAI	Unknown	Proprietary	Aug 2025
GPT-5 mini	OpenAI	Unknown	Proprietary	Aug 2025
GPT-5 nano	OpenAI	Unknown	Proprietary	Aug 2025
GPT-4.1	OpenAI	Unknown	Proprietary	Apr 2025
GPT-4.1 mini	OpenAI	Unknown	Proprietary	Apr 2025
GPT-4.1 nano	OpenAI	Unknown	Proprietary	Apr 2025
GPT-3.5 Turbo	OpenAI	Unknown	Proprietary	Nov 2022
o3	OpenAI	Unknown	Proprietary	Apr 2025
o3-pro	OpenAI	Unknown	Proprietary	Jun 2025
o3-mini	OpenAI	Unknown	Proprietary	Jan 2025
o4-mini	OpenAI	Unknown	Proprietary	Apr 2025
o1	OpenAI	Unknown	Proprietary	Sep 2024
o1-mini	OpenAI	Unknown	Proprietary	Sep 2024
Claude Fable 5	Anthropic	Unknown	Proprietary	Jun 2026
Claude Mythos 5	Anthropic	Unknown	Proprietary	Jun 2026
Claude Sonnet 5	Anthropic	Unknown	Proprietary	Jun 2026
Claude Opus 5	Anthropic	Unknown	Proprietary	Jul 2026
Claude Opus 4.8	Anthropic	Unknown	Proprietary	May 2026
Claude Opus 4.7	Anthropic	Unknown	Proprietary	Apr 2026
Claude Opus 4.6	Anthropic	Unknown	Proprietary	Feb 2026
Claude Sonnet 4.6	Anthropic	Unknown	Proprietary	Feb 2026
Claude Opus 4.5	Anthropic	Unknown	Proprietary	Nov 2025
Claude Opus 4.1	Anthropic	Unknown	Proprietary	Aug 2025
Claude Sonnet 4.5	Anthropic	Unknown	Proprietary	Sep 2025
Claude Haiku 4.5	Anthropic	Unknown	Proprietary	Oct 2025
Claude Sonnet 4	Anthropic	Unknown	Proprietary	May 2025
Claude Opus 4	Anthropic	Unknown	Proprietary	May 2025
Claude Sonnet 3.7	Anthropic	Unknown	Proprietary	Feb 2025
Claude 3.5 Haiku	Anthropic	Unknown	Proprietary	Oct 2024
Gemini 3.6 Flash MoE	Google	Unknown	Proprietary	Jul 2026
Gemini 3.5 Flash MoE	Google	Unknown	Proprietary	May 2026
Gemini 3.1 Pro MoE	Google	Unknown	Proprietary	Feb 2026
Gemini 3 Flash MoE	Google	Unknown	Proprietary	Dec 2025
Gemini 3.5 Flash-Lite MoE	Google	Unknown	Proprietary	Jul 2026
Gemini 3.1 Flash-Lite MoE	Google	Unknown	Proprietary	Feb 2026
Gemini 2.5 Pro MoE	Google	Unknown	Proprietary	Mar 2025
Gemini 2.5 Flash MoE	Google	Unknown	Proprietary	Apr 2025
Gemini 2.5 Flash-Lite MoE	Google	Unknown	Proprietary	May 2025
Gemini 3 Pro MoE	Google	Unknown	Proprietary	Dec 2025
Gemini 2.0 Flash MoE	Google	Unknown	Proprietary	Dec 2024
Gemini 1.5 Pro MoE	Google	Unknown	Proprietary	Feb 2024
Grok 4.5	xAI	Unknown	Proprietary	Jul 2026
Grok 4.3	xAI	Unknown	Proprietary	Apr 2026
Grok Build 0.1	xAI	Unknown	Proprietary	Jun 2026
Grok 4	xAI	Unknown	Proprietary	Jul 2025
Grok 3	xAI	Unknown	Proprietary	Feb 2025
Grok 2	xAI	Unknown	Proprietary	Aug 2024
Mistral Medium 3.5	Mistral AI	Unknown	Proprietary	Jun 2026
Mistral Small 3	Mistral AI	Unknown	Open Weights	Jan 2025
MiniMax M3	MiniMax	Unknown	Proprietary	Jun 2026
Qwen 3.7 Max MoE	Alibaba	Unknown	Proprietary	May 2026
Qwen 3.7 Plus MoE	Alibaba	Unknown	Proprietary	Jun 2026
Nova 2 Lite	Amazon	Unknown	Proprietary	Dec 2025
Nova Premier 1.0	Amazon	Unknown	Proprietary	Apr 2025
Nova Pro 1.0	Amazon	Unknown	Proprietary	Dec 2024
Nova Lite 1.0	Amazon	Unknown	Proprietary	Dec 2024
Nova Micro 1.0	Amazon	Unknown	Proprietary	Dec 2024
Sonar	Perplexity	Unknown	Proprietary	Jan 2025
Sonar Pro	Perplexity	Unknown	Proprietary	Jan 2025
Sonar Reasoning Pro	Perplexity	Unknown	Proprietary	Feb 2025
Sonar Deep Research	Perplexity	Unknown	Proprietary	Feb 2025
MiMo-V2.5 MoE	Xiaomi	Unknown	Open Weights	Apr 2026
Solar Mini	Upstage	Unknown	Proprietary	Jan 2024
Kimi K3 MoE	Moonshot AI	2.8T	Proprietary	Jul 2026
Claude 3 Opus	Anthropic	2T*	Proprietary	Mar 2024
Llama 4 Behemoth MoE(288B active)	Meta	2T	Open Weights	Apr 2025
GPT-4 MoE(220B active)	OpenAI	1.76T*	Proprietary	Mar 2023
DeepSeek-V4-Pro MoE(49B active)	DeepSeek	1.6T	Open Weights	Apr 2026
Kimi K2.6 MoE(32B active)	Moonshot AI	1T	Open Weights	Apr 2026
Kimi K2.7 Code MoE(32B active)	Moonshot AI	1T	Open Weights	Jun 2026
Qwen 3.6 Max-Preview MoE	Alibaba	1T*	Proprietary	Apr 2026
Yi-Large MoE	01.AI	1T	Proprietary	May 2024
MiMo-V2.5-Pro MoE(42B active)	Xiaomi	1T	Open Weights	Apr 2026
MiMo-V2.5-Pro-UltraSpeed MoE(42B active)	Xiaomi	1T	Open Weights	Jun 2026
GLM-5.1 MoE	Z.ai	754B	Open Weights	Apr 2026
GLM-5.2 MoE	Z.ai	753B	Open Weights	Jun 2026
GLM-5 MoE(40B active)	Z.ai	744B	Open Weights	Feb 2026
DeepSeek-V3.2 MoE(37B active)	DeepSeek	685B	Open Weights	Dec 2025
Mistral Large 3 MoE(41B active)	Mistral AI	675B	Open Weights	Dec 2025
DeepSeek-V3 MoE(37B active)	DeepSeek	671B	Open Weights	Dec 2024
DeepSeek-R1 MoE(37B active)	DeepSeek	671B	Open Weights	Jan 2025
PaLM	Google	540B	Proprietary	Apr 2022
Megatron-Turing NLG	NVIDIA	530B	Proprietary	Jan 2022
Llama 3.1 405B	Meta	405B	Open Weights	Jul 2024
Llama 4 Maverick MoE(17B active)	Meta	400B	Open Weights	Apr 2025
Nemotron-4 340B	NVIDIA	340B	Open Weights	Jun 2024
PaLM 2	Google	340B*	Proprietary	May 2023
Grok 1 MoE(86B active)	xAI	314B	Open Weights	Nov 2023
DeepSeek-V4-Flash MoE(13B active)	DeepSeek	284B	Open Weights	Apr 2026
DeepSeek-V2 MoE(21B active)	DeepSeek	236B	Open Weights	May 2024
GPT-4o	OpenAI	200B*	Proprietary	May 2024
Step 3.7 Flash MoE(11B active)	StepFun	198B	Open Weights	May 2026
Step 3.5 Flash MoE(11B active)	StepFun	196.8B	Open Weights	Feb 2026
Falcon 180B	TII	180B	Open Weights	Sep 2023
Mixtral 8x22B MoE(44B active)	Mistral AI	176B	Open Weights	Apr 2024
BLOOM	BigScience	176B	Open Source	Jul 2022
GPT-3	OpenAI	175B	Proprietary	Jun 2020
Claude 3.5 Sonnet	Anthropic	175B*	Proprietary	Jun 2024
OPT-175B	Meta	175B	Open Source	May 2022
LaMDA	Google	137B	Proprietary	Jan 2022
DBRX MoE(36B active)	Databricks	132B	Open Weights	Mar 2024
Mistral Large 2	Mistral AI	123B	Open Weights	Jul 2024
Mistral Small 4 MoE(6B active)	Mistral AI	119B	Open Weights	Mar 2026
Command A	Cohere	111B	Proprietary	Mar 2025
Llama 4 Scout MoE(17B active)	Meta	109B	Open Weights	Apr 2025
Command R+	Cohere	104B	Open Weights	Apr 2024
Solar Pro 3 MoE	Upstage	102B	Proprietary	Jan 2026
Qwen 2.5 72B	Alibaba	72B	Open Weights	Sep 2024
Claude 3 Sonnet	Anthropic	70B*	Proprietary	Mar 2024
Llama 3.3 70B	Meta	70B	Open Weights	Dec 2024
Llama 3.1 70B	Meta	70B	Open Weights	Jul 2024
Llama 3 70B	Meta	70B	Open Weights	Apr 2024
Llama 2 70B	Meta	70B	Open Weights	Jul 2023
Mixtral 8x7B MoE(14B active)	Mistral AI	56B	Open Weights	Dec 2023
Falcon 40B	TII	40B	Open Source	May 2023
Yi-34B	01.AI	34B	Open Weights	Nov 2023
Qwen 2.5 32B	Alibaba	32B	Open Weights	Sep 2024
Command R	Cohere	32B	Open Weights	Mar 2024
Solar Pro 2	Upstage	31B	Proprietary	Jul 2025
Gemma 2 27B	Google	27B	Open Weights	Jun 2024
Claude 3 Haiku	Anthropic	20B*	Proprietary	Mar 2024
Qwen 2.5 14B	Alibaba	14B	Open Weights	Sep 2024
Phi-4	Microsoft	14B	Open Weights	Dec 2024
Gemma 2 9B	Google	9B	Open Weights	Jun 2024
GPT-4o mini	OpenAI	8B*	Proprietary	Jul 2024
Llama 3.1 8B	Meta	8B	Open Weights	Jul 2024
Llama 3 8B	Meta	8B	Open Weights	Apr 2024
Ministral 8B	Mistral AI	8B	Open Weights	Oct 2024
Mistral 7B	Mistral AI	7B	Open Source	Sep 2023
Qwen 2.5 7B	Alibaba	7B	Open Weights	Sep 2024
Command R7B	Cohere	7B	Open Weights	Dec 2024
Phi-4 Multimodal	Microsoft	5.6B	Open Weights	Feb 2025
Phi-4 mini	Microsoft	3.8B	Open Weights	Feb 2025
Phi-3 mini	Microsoft	3.8B	Open Weights	Apr 2024
Gemini Nano 2	Google	3.3B	Proprietary	Dec 2023
Ministral 3B	Mistral AI	3B	Open Weights	Oct 2024
Gemma 2 2B	Google	2B	Open Weights	Jul 2024
Gemini Nano 1	Google	1.8B	Proprietary	Dec 2023
GPT-2	OpenAI	1.5B	Open Source	Feb 2019
Qwen 2.5 0.5B	Alibaba	0.5B	Open Weights	Sep 2024

Parameter sizes of popular Large Language Models (as of May 2026)

4. Context Windows: From 200,000 to 10 Million Tokens

The context window determines how much text a model can process at once. Here the orders of magnitude have multiplied over the past two years. The overview below covers more than 140 current models, sortable and filterable by provider:

Legend:

1M+ Tokens

200K-1M Tokens

100K-200K Tokens

32K-100K Tokens

Under 32K Tokens

Showing 209 models

Context window sizes of current AI language models (as of May 2026)
Model	Developer	Context Window	Equivalent to
Llama 4 Scout	Meta	10M	≈ 25,000 pages (about 30 Harry Potter books)
Qwen-Long	Alibaba	10M	≈ 25,000 pages (about 30 Harry Potter books)
Gemini 2.0 Pro	Google	2M	≈ 5,000 pages (about 6 Harry Potter books)
Gemini 1.5 Pro	Google	2M	≈ 5,000 pages (about 6 Harry Potter books)
Grok 4.1 Fast	xAI	2M	≈ 5,000 pages (about 6 Harry Potter books)
Grok 4 Fast	xAI	2M	≈ 5,000 pages (about 6 Harry Potter books)
Llama 4 Maverick	Meta	1M	≈ 2,500 pages (about 3 Harry Potter books)
Gemini 3.6 Flash	Google	1M	≈ 2,500 pages (about 3 Harry Potter books)
Gemini 3.5 Flash	Google	1M	≈ 2,500 pages (about 3 Harry Potter books)
Gemini 3.1 Pro	Google	1M	≈ 2,500 pages (about 3 Harry Potter books)
Gemini 3.5 Flash-Lite	Google	1M	≈ 2,500 pages (about 3 Harry Potter books)
Gemini 3.1 Flash-Lite	Google	1M	≈ 2,500 pages (about 3 Harry Potter books)
Gemini 3 Pro	Google	1M	≈ 2,500 pages (about 3 Harry Potter books)
Gemini 3 Flash	Google	1M	≈ 2,500 pages (about 3 Harry Potter books)
Gemini 2.5 Pro	Google	1M	≈ 2,500 pages (about 3 Harry Potter books)
Gemini 2.5 Flash	Google	1M	≈ 2,500 pages (about 3 Harry Potter books)
Gemini 2.5 Flash-Lite	Google	1M	≈ 2,500 pages (about 3 Harry Potter books)
Gemini 2.0 Flash	Google	1M	≈ 2,500 pages (about 3 Harry Potter books)
Gemini 1.5 Flash	Google	1M	≈ 2,500 pages (about 3 Harry Potter books)
Grok 4.3	xAI	1M	≈ 2,500 pages (about 3 Harry Potter books)
Claude Fable 5	Anthropic	1M	≈ 2,500 pages (about 3 Harry Potter books)
Claude Mythos 5	Anthropic	1M	≈ 2,500 pages (about 3 Harry Potter books)
Claude Sonnet 5	Anthropic	1M	≈ 2,500 pages (about 3 Harry Potter books)
Claude Opus 5	Anthropic	1M	≈ 2,500 pages (about 3 Harry Potter books)
Claude Opus 4.8	Anthropic	1M	≈ 2,500 pages (about 3 Harry Potter books)
Claude Opus 4.7	Anthropic	1M	≈ 2,500 pages (about 3 Harry Potter books)
Claude Opus 4.6	Anthropic	1M	≈ 2,500 pages (about 3 Harry Potter books)
Claude Sonnet 4.6	Anthropic	1M	≈ 2,500 pages (about 3 Harry Potter books)
GPT-5.6 Sol	OpenAI	1M	≈ 2,500 pages (about 3 Harry Potter books)
GPT-5.6 Terra	OpenAI	1M	≈ 2,500 pages (about 3 Harry Potter books)
GPT-5.6 Luna	OpenAI	1M	≈ 2,500 pages (about 3 Harry Potter books)
GPT-5.5	OpenAI	1M	≈ 2,500 pages (about 3 Harry Potter books)
GPT-5.5 Pro	OpenAI	1M	≈ 2,500 pages (about 3 Harry Potter books)
ChatGPT chat-latest	OpenAI	1M	≈ 2,500 pages (about 3 Harry Potter books)
GPT-5.4	OpenAI	1M	≈ 2,500 pages (about 3 Harry Potter books)
GPT-5.4 Pro	OpenAI	1M	≈ 2,500 pages (about 3 Harry Potter books)
GPT-4.1	OpenAI	1M	≈ 2,500 pages (about 3 Harry Potter books)
GPT-4.1 mini	OpenAI	1M	≈ 2,500 pages (about 3 Harry Potter books)
GPT-4.1 nano	OpenAI	1M	≈ 2,500 pages (about 3 Harry Potter books)
DeepSeek V4 Pro	DeepSeek	1M	≈ 2,500 pages (about 3 Harry Potter books)
DeepSeek V4 Flash	DeepSeek	1M	≈ 2,500 pages (about 3 Harry Potter books)
Kimi K3	Moonshot AI	1M	≈ 2,500 pages (about 3 Harry Potter books)
MiniMax M3	MiniMax	1M	≈ 2,500 pages (about 3 Harry Potter books)
Qwen 3.7 Max	Alibaba	1M	≈ 2,500 pages (about 3 Harry Potter books)
Qwen-Plus	Alibaba	1M	≈ 2,500 pages (about 3 Harry Potter books)
Qwen-Turbo	Alibaba	1M	≈ 2,500 pages (about 3 Harry Potter books)
Qwen 3.7 Plus	Alibaba	1M	≈ 2,500 pages (about 3 Harry Potter books)
GLM-5.2	Z.ai	1M	≈ 2,500 pages (about 3 Harry Potter books)
MiMo-V2.5	Xiaomi	1M	≈ 2,500 pages (about 3 Harry Potter books)
MiMo-V2.5-Pro	Xiaomi	1M	≈ 2,500 pages (about 3 Harry Potter books)
MiMo-V2.5-Pro-UltraSpeed	Xiaomi	1M	≈ 2,500 pages (about 3 Harry Potter books)
Amazon Nova Premier	Amazon	1M	≈ 2,500 pages (about 3 Harry Potter books)
Amazon Nova 2 Lite	Amazon	1M	≈ 2,500 pages (about 3 Harry Potter books)
Amazon Nova 2 Sonic	Amazon	1M	≈ 2,500 pages (about 3 Harry Potter books)
MiniMax-01	MiniMax	1M	≈ 2,500 pages (about 3 Harry Potter books)
Grok 4.5	xAI	500K	≈ 1,250 pages (about 5 novels)
GPT-5.5 Instant	OpenAI	400K	≈ 1,000 pages (about 4 novels)
GPT-5.4 mini	OpenAI	400K	≈ 1,000 pages (about 4 novels)
GPT-5.4 nano	OpenAI	400K	≈ 1,000 pages (about 4 novels)
GPT-5.3-Codex	OpenAI	400K	≈ 1,000 pages (about 4 novels)
GPT-5.3 Instant	OpenAI	400K	≈ 1,000 pages (about 4 novels)
GPT-5.2	OpenAI	400K	≈ 1,000 pages (about 4 novels)
GPT-5.1 Thinking	OpenAI	400K	≈ 1,000 pages (about 4 novels)
GPT-5.1 Instant	OpenAI	400K	≈ 1,000 pages (about 4 novels)
GPT-5.2 Pro	OpenAI	400K	≈ 1,000 pages (about 4 novels)
GPT-5.1	OpenAI	400K	≈ 1,000 pages (about 4 novels)
GPT-5	OpenAI	400K	≈ 1,000 pages (about 4 novels)
GPT-5 pro	OpenAI	400K	≈ 1,000 pages (about 4 novels)
GPT-5 mini	OpenAI	400K	≈ 1,000 pages (about 4 novels)
GPT-5 nano	OpenAI	400K	≈ 1,000 pages (about 4 novels)
Amazon Nova Pro	Amazon	300K	≈ 750 pages (about 3 novels)
Amazon Nova Lite	Amazon	300K	≈ 750 pages (about 3 novels)
Kimi K2.6	Moonshot AI	262.14K	≈ 655 pages (about 2 novels)
Kimi K2.7 Code	Moonshot AI	262.14K	≈ 655 pages (about 2 novels)
Qwen 3.6 Max-Preview	Alibaba	262.14K	≈ 655 pages (about 2 novels)
Qwen3-Max	Alibaba	262.14K	≈ 655 pages (about 2 novels)
Grok Build 0.1	xAI	256K	≈ 640 pages (about 2 novels)
Grok 4.1	xAI	256K	≈ 640 pages (about 2 novels)
Grok 4	xAI	256K	≈ 640 pages (about 2 novels)
Mistral Large 3	Mistral	256K	≈ 640 pages (about 2 novels)
Mistral Medium 3.5	Mistral	256K	≈ 640 pages (about 2 novels)
Mistral Small 4	Mistral	256K	≈ 640 pages (about 2 novels)
Codestral Mamba	Mistral	256K	≈ 640 pages (about 2 novels)
Qwen3-235B-A22B (256K Update)	Alibaba	256K	≈ 640 pages (about 2 novels)
Step 3.5 Flash	StepFun	256K	≈ 640 pages (about 2 novels)
Step 3.7 Flash	StepFun	256K	≈ 640 pages (about 2 novels)
Command A	Cohere	256K	≈ 640 pages (about 2 novels)
Command A Reasoning	Cohere	256K	≈ 640 pages (about 2 novels)
Jamba 1.5 Large	AI21 Labs	256K	≈ 640 pages (about 2 novels)
Jamba 1.5 Mini	AI21 Labs	256K	≈ 640 pages (about 2 novels)
Jamba	AI21 Labs	256K	≈ 640 pages (about 2 novels)
abab6.5s	MiniMax	245.76K	≈ 614 pages (about 2 novels)
Claude Opus 4.5	Anthropic	200K	≈ 500 pages (about 2 novels)
Claude Sonnet 4.5	Anthropic	200K	≈ 500 pages (about 2 novels)
Claude Haiku 4.5	Anthropic	200K	≈ 500 pages (about 2 novels)
Claude Sonnet 4	Anthropic	200K	≈ 500 pages (about 2 novels)
Claude Opus 4	Anthropic	200K	≈ 500 pages (about 2 novels)
Claude Opus 4.1	Anthropic	200K	≈ 500 pages (about 2 novels)
Claude Sonnet 3.7	Anthropic	200K	≈ 500 pages (about 2 novels)
Claude 3.5 Sonnet	Anthropic	200K	≈ 500 pages (about 2 novels)
Claude 3.5 Haiku	Anthropic	200K	≈ 500 pages (about 2 novels)
Claude 3 Opus	Anthropic	200K	≈ 500 pages (about 2 novels)
Claude 3 Sonnet	Anthropic	200K	≈ 500 pages (about 2 novels)
Claude 3 Haiku	Anthropic	200K	≈ 500 pages (about 2 novels)
o3	OpenAI	200K	≈ 500 pages (about 2 novels)
o3-pro	OpenAI	200K	≈ 500 pages (about 2 novels)
o4-mini	OpenAI	200K	≈ 500 pages (about 2 novels)
o3-mini	OpenAI	200K	≈ 500 pages (about 2 novels)
o1	OpenAI	200K	≈ 500 pages (about 2 novels)
GLM-5.1	Z.ai	200K	≈ 500 pages (about 2 novels)
GLM-5	Z.ai	200K	≈ 500 pages (about 2 novels)
Sonar Pro	Perplexity	200K	≈ 500 pages (about 2 novels)
Yi-34B-200K	01.AI	200K	≈ 500 pages (about 2 novels)
Yi-6B-200K	01.AI	200K	≈ 500 pages (about 2 novels)
Grok 3	xAI	131.07K	≈ 328 pages (about 1 novel)
Solar Pro 3	Upstage	131.07K	≈ 328 pages (about 1 novel)
Llama 3.3 70B	Meta	128K	≈ 320 pages (about 1 novel)
Llama 3.2 90B Vision	Meta	128K	≈ 320 pages (about 1 novel)
Llama 3.2 11B Vision	Meta	128K	≈ 320 pages (about 1 novel)
Llama 3.2 3B	Meta	128K	≈ 320 pages (about 1 novel)
Llama 3.2 1B	Meta	128K	≈ 320 pages (about 1 novel)
Llama 3.1 405B	Meta	128K	≈ 320 pages (about 1 novel)
Llama 3.1 70B	Meta	128K	≈ 320 pages (about 1 novel)
Llama 3.1 8B	Meta	128K	≈ 320 pages (about 1 novel)
Gemma 3 27B	Google	128K	≈ 320 pages (about 1 novel)
Gemma 3 12B	Google	128K	≈ 320 pages (about 1 novel)
Gemma 3 4B	Google	128K	≈ 320 pages (about 1 novel)
Grok 2	xAI	128K	≈ 320 pages (about 1 novel)
o1-mini	OpenAI	128K	≈ 320 pages (about 1 novel)
GPT-4.5	OpenAI	128K	≈ 320 pages (about 1 novel)
GPT-4o	OpenAI	128K	≈ 320 pages (about 1 novel)
GPT-4o mini	OpenAI	128K	≈ 320 pages (about 1 novel)
GPT-4 Turbo	OpenAI	128K	≈ 320 pages (about 1 novel)
DeepSeek V3.1	DeepSeek	128K	≈ 320 pages (about 1 novel)
DeepSeek V3	DeepSeek	128K	≈ 320 pages (about 1 novel)
DeepSeek R1	DeepSeek	128K	≈ 320 pages (about 1 novel)
DeepSeek R1 Distill Llama 70B	DeepSeek	128K	≈ 320 pages (about 1 novel)
DeepSeek R1 Distill Qwen 32B	DeepSeek	128K	≈ 320 pages (about 1 novel)
DeepSeek R1 Distill Qwen 14B	DeepSeek	128K	≈ 320 pages (about 1 novel)
DeepSeek R1 Distill Qwen 7B	DeepSeek	128K	≈ 320 pages (about 1 novel)
DeepSeek R1 Distill Llama 8B	DeepSeek	128K	≈ 320 pages (about 1 novel)
DeepSeek V2.5	DeepSeek	128K	≈ 320 pages (about 1 novel)
DeepSeek Coder V2	DeepSeek	128K	≈ 320 pages (about 1 novel)
Mistral Large 2	Mistral	128K	≈ 320 pages (about 1 novel)
Mistral Small 3	Mistral	128K	≈ 320 pages (about 1 novel)
Ministral 8B	Mistral	128K	≈ 320 pages (about 1 novel)
Ministral 3B	Mistral	128K	≈ 320 pages (about 1 novel)
Mistral NeMo	Mistral	128K	≈ 320 pages (about 1 novel)
Qwen3-235B-A22B	Alibaba	128K	≈ 320 pages (about 1 novel)
Qwen3-32B	Alibaba	128K	≈ 320 pages (about 1 novel)
Qwen3-14B	Alibaba	128K	≈ 320 pages (about 1 novel)
Qwen3-8B	Alibaba	128K	≈ 320 pages (about 1 novel)
Qwen3-30B-A3B	Alibaba	128K	≈ 320 pages (about 1 novel)
Qwen 2.5 72B	Alibaba	128K	≈ 320 pages (about 1 novel)
Qwen 2.5 32B	Alibaba	128K	≈ 320 pages (about 1 novel)
Qwen 2.5 14B	Alibaba	128K	≈ 320 pages (about 1 novel)
Qwen 2.5 7B	Alibaba	128K	≈ 320 pages (about 1 novel)
Qwen 2.5 Coder 32B	Alibaba	128K	≈ 320 pages (about 1 novel)
Qwen 2.5 Coder 14B	Alibaba	128K	≈ 320 pages (about 1 novel)
Qwen 2.5 Coder 7B	Alibaba	128K	≈ 320 pages (about 1 novel)
Command R7B	Cohere	128K	≈ 320 pages (about 1 novel)
Sonar	Perplexity	128K	≈ 320 pages (about 1 novel)
Sonar Reasoning Pro	Perplexity	128K	≈ 320 pages (about 1 novel)
Sonar Deep Research	Perplexity	128K	≈ 320 pages (about 1 novel)
Command R+	Cohere	128K	≈ 320 pages (about 1 novel)
Command R	Cohere	128K	≈ 320 pages (about 1 novel)
Amazon Nova Micro	Amazon	128K	≈ 320 pages (about 1 novel)
Phi-4-mini	Microsoft	128K	≈ 320 pages (about 1 novel)
Phi-3.5-mini	Microsoft	128K	≈ 320 pages (about 1 novel)
Phi-3.5-MoE	Microsoft	128K	≈ 320 pages (about 1 novel)
Phi-3 Medium	Microsoft	128K	≈ 320 pages (about 1 novel)
Phi-3 Small	Microsoft	128K	≈ 320 pages (about 1 novel)
Phi-3 Mini	Microsoft	128K	≈ 320 pages (about 1 novel)
Yi-Coder 9B	01.AI	128K	≈ 320 pages (about 1 novel)
Yi-Coder 1.5B	01.AI	128K	≈ 320 pages (about 1 novel)
Llama-3.1-Nemotron-70B	Nvidia	128K	≈ 320 pages (about 1 novel)
Llama-3.1-Nemotron-51B	Nvidia	128K	≈ 320 pages (about 1 novel)
Mistral-NeMo-Minitron 8B	Nvidia	128K	≈ 320 pages (about 1 novel)
Reka Core	Reka	128K	≈ 320 pages (about 1 novel)
Reka Flash	Reka	128K	≈ 320 pages (about 1 novel)
Reka Edge	Reka	128K	≈ 320 pages (about 1 novel)
GLM-4	Zhipu AI	128K	≈ 320 pages (about 1 novel)
ChatGLM3-6B	Zhipu AI	128K	≈ 320 pages (about 1 novel)
ERNIE 4.0	Baidu	128K	≈ 320 pages (about 1 novel)
Mixtral 8x22B	Mistral	65.54K	≈ 164 pages
Solar Pro 2	Upstage	65.54K	≈ 164 pages
Phi-4-mini-flash-reasoning	Microsoft	64K	≈ 160 pages
Mixtral 8x7B	Mistral	32.77K	≈ 82 pages
Codestral	Mistral	32.77K	≈ 82 pages
Qwen3-4B	Alibaba	32.77K	≈ 82 pages
Qwen3-1.7B	Alibaba	32.77K	≈ 82 pages
Qwen3-0.6B	Alibaba	32.77K	≈ 82 pages
Solar Mini	Upstage	32.77K	≈ 82 pages
Phi-4-reasoning	Microsoft	32.77K	≈ 82 pages
DBRX	Databricks	32.77K	≈ 82 pages
Gemma 3 1B	Google	32K	≈ 80 pages
Yi-Large	01.AI	32K	≈ 80 pages
Phi-4	Microsoft	16.38K	≈ 41 pages
Yi-Zap	01.AI	16K	≈ 40 pages
Gemma 2 27B	Google	8.19K	≈ 20 pages
Gemma 2 9B	Google	8.19K	≈ 20 pages
GPT-4	OpenAI	8.19K	≈ 20 pages
Jurassic-2 Ultra	AI21 Labs	8.19K	≈ 20 pages
GLM-4V	Zhipu AI	8.19K	≈ 20 pages
ERNIE 3.5	Baidu	8K	≈ 20 pages
Command	Cohere	4.1K	≈ 10 pages
Nemotron-4 340B	Nvidia	4.1K	≈ 10 pages
StableLM 2 12B	Stability AI	4.1K	≈ 10 pages
StableLM Zephyr 3B	Stability AI	4.1K	≈ 10 pages

Context window sizes of current AI language models (as of May 2026)

At the top are Llama 4 Scout and Qwen-Long with 10 million tokens each. That's roughly 30 Harry Potter books in a single prompt. Current all-rounders mostly sit around 1 million tokens. GPT-5.5 is at 1 million, while Claude Opus 5 and Gemini 3.1 Pro are at 1 million. For more on the individual model families, see our overviews of the Claude models and Gemini models.

5. What Does an LLM Cost? Prices per 1 Million Tokens

API prices span worlds. The cheapest model with API access is GPT-5 nano at $0.05 per 1M input tokens. The most expensive is GPT-5.5 Pro at $30, a 600x difference.

More interesting than the raw price is the ratio of price to performance. The chart below plots input price against coding performance (SWE-bench Verified). Models toward the bottom right are ideal: strong and cheap.

Price-performance: SWE-bench vs. input price

Anthropic

OpenAI

Google

DeepSeek

Moonshot AI

Efficiency frontier (best price-performance)

Sources: gradually.ai LLM database (pricing + benchmarks)

CC BY 4.0

gradually.ai

The quiet star of this chart is DeepSeek-V4-Pro. At 80.6% SWE-bench for just $0.435 input price, it sits right on the efficiency frontier, no other model is both stronger and cheaper. So if you don't strictly need the last few percentage points of coding performance, the open models offer an extremely good price-performance ratio. For a detailed cost estimate of your specific usage, see the API cost calculator.

6. LLM Performance Head to Head

To make the strengths and weaknesses of the top models visible at a glance, the radar below compares five representative frontier models across four dimensions: reasoning, coding, context window, and price efficiency. Each axis is scaled relative to the five models so even small leads become visible. The real values appear in the tooltip.

Claude Opus 4.8

Gemini 3.1 Pro

Gemini 3.5 Flash

Claude Sonnet 4.6

GPT-5.5

Sources: Artificial Analysis, gradually.ai LLM database

CC BY 4.0

gradually.ai

The pattern is clear. Claude Opus 4.8 and GPT-5.5 dominate on raw coding performance but are expensive. Gemini 3.5 Flash flips that, nearly on par on reasoning and only trailing on coding, yet with the best price efficiency in the field. Every AI project comes down to this one trade-off in the end, maximum quality versus maximum economy.

7. Open Source vs. Proprietary

One of the most important developments of 2026 is the catch-up of open models. Of the 157 tracked models, 98 are proprietary and 59 are openly available, 54 of them open-weights and 5 fully open-source.

But at the very top:

According to the Stanford AI Index 2026, the best closed model led the best open-weights model by 3.3 percentage points in early 2026. In August 2024, the gap had been only 0.5 percentage points. So at the top it has not been shrinking but widening again, with six of the top-ten models in the Chatbot Arena now closed once more. Our data shows the same lead on coding: DeepSeek-V4-Pro (80.6% SWE-bench) and Kimi K2.6 (80.2%) trail Claude Opus 4.8 (88.6%) by about 8 percentage points. GPT-5.5 scores 82.6% in the Vals AI harness. For an overview of the best free models, see our article on open-source LLMs.

How the license mix breaks down by provider is shown below: column width represents the number of tracked models per provider, and the colors mark the license type.

Proprietary

Open-source

Open-weights

Source: gradually.ai LLM database

CC BY 4.0

gradually.ai

8. Knowledge Cutoff: How Current Are the Models?

Every model has a knowledge cutoff, after which it has learned nothing more about the world. Right now the freshest cutoff in our database is January 2026:

Claude Fable 5

Jan. 2026

Claude Opus 4.8

Jan. 2026

GPT-5.5

Dec. 2025

GPT-5.5 Instant

Dec. 2025

GPT-5.3 Codex

Aug. 2025

GPT-5.2

Aug. 2025

Claude Opus 4.6

May 2025

Claude Sonnet 4.6

May 2025

Claude Opus 4.5

Mar. 2025

Gemini 3.1 Pro

Jan. 2025

Gemini 3 Flash

Jan. 2025

Gemini 2.5 Pro

Jan. 2025

DeepSeek R1

Jan. 2025

DeepSeek V3.1

Dec. 2024

Grok 4.1

Nov. 2024

Qwen3-Max

Nov. 2024

Mistral Large 3

Oct. 2024

GPT-5

Oct. 2024

Llama 4 Scout

Aug. 2024

Gemini 2.0 Flash

Aug. 2024

Amazon Nova Pro

Aug. 2024

GPT-4.1

June 2024

GPT-5 mini

May 2024

Source: gradually.ai LLM database

CC BY 4.0

gradually.ai

Between the knowledge cutoff and the release date there are usually six to eight months in which the model is trained and tested. For current events, the models therefore almost always need a web search. Raw model knowledge is always a few months old.

9. Release Pace: The Cadence of the Labs

How fast the market moves shows in the release timeline. What happened quarterly in 2024 comes almost monthly in 2026:

May 2024

GPT-4o

OpenAI makes real-time multimodal models the default.

Jan. 2025

DeepSeek-R1

First open reasoning model at frontier level, kicking off the open-weights wave.

June 2025

GPT-5

OpenAI merges reasoning and standard mode into one model family.

Dec. 2025

Gemini 3 Pro

Google opens the third Gemini generation with its first model.

Dec. 2025

GPT-5.2

OpenAI follows up with an improved reasoning update.

Dec. 2025

Mistral Large 3

Mistral counters with an open MoE model from Europe.

Feb. 2026

Claude Opus 4.6

Anthropic raises the reasoning bar with the new Opus.

Feb. 2026

Gemini 3.1 Pro

Google takes the GPQA Diamond lead at 94.3%.

April 2026

GPT-5.5

Scores 82.6% SWE-bench Verified in the Vals AI harness.

April 2026

Claude Opus 4.7

Anthropic reaches 82.0% on coding, just behind GPT-5.5.

April 2026

DeepSeek-V4-Pro

Open model hits 80.6% SWE-bench at a fraction of the price.

May 2026

Claude Opus 4.8

Hits 88.6% SWE-bench, the active coding benchmark at the time.

May 2026

Gemini 3.5 Flash

Google ships a fast, price-efficient Flash model.

June 2026

Claude Fable 5

Anthropic expands the lineup with a specialized variant. Back online since July 1 after a June 12-30 export-control pause, now the new coding benchmark at 95.0% SWE-bench Verified.

June 2026

Claude Mythos 5

A second specialized model, available through the API at first.

July 2026

GPT-5.6 Sol/Terra/Luna GA

OpenAI makes the GPT-5.6 family generally available on July 9, according to Axios/Bloomberg one day after government restrictions were lifted. Sol default in Codex (Ultra mode there from Plus), Terra free for Free/Go only inside Codex so far, while GPT-5.5 Instant remains the regular chat default. Sol at 88.8% on Terminal-Bench 2.1 (Sol Ultra 91.9%), 64.6% on SWE-Bench Pro, 80 on the Coding Agent Index.

July 2026

Kimi K3

Moonshot AI unveils its 2.8-trillion-parameter MoE on July 16: 1M context, native vision, 93.5% on GPQA Diamond, API pricing $3/$15. Weights have been available since July 27 under Moonshot’s own “Kimi K3 License”, making it the largest open-weight model ever released.

July 2026

Gemini 3.6 Flash

Google ships the new, more efficient Flash generation on July 21: 17% fewer output tokens than the still-active 3.5 Flash, $7.50 instead of $9 per 1M output tokens, March 2026 knowledge cutoff. Gemini 3.5 Flash-Lite also launches for high-throughput tasks.

July 2026

Claude Opus 5

Anthropic releases Opus 5 on July 24, close to Fable 5 in performance but at half the price ($5/$25) and with the most recent knowledge cutoff of any Claude model (May 2026). Opus 4.8 is now considered superseded.

Plotting every tracked model onto its release month makes the clustering visible: the darker a cell, the more models shipped that month.

Releases: lowhigh(max 14)

Source: gradually.ai LLM database

CC BY 4.0

gradually.ai

December 2025 was especially dense, when Google, OpenAI, and Mistral all shipped new flagships in the same month. So was April 2026, which brought GPT-5.5, Claude Opus 4.7, DeepSeek-V4-Pro, Kimi K2.6, and Qwen 3.6 Max, five top models at once. If you want to keep up here, don't cling too tightly to individual version numbers.

10. Model Status: Active, Deprecated, Legacy

Not every model ever released is still usable. Across the three big providers Anthropic, Google, and OpenAI, we track the lifecycle of 87 models. Here is how they split across the individual statuses:

87models

Active4450.6%

Deprecated2731%

Legacy78%

Pro-exclusive33.4%

API only22.3%

Preview22.3%

Open source22.3%

Source: gradually.ai LLM database

CC BY 4.0

gradually.ai

Just over half of the models are still active, and nearly a third are already deprecated. And lifecycles are getting shorter. A good example is Gemini 3 Pro, deprecated only about three months after its release because Gemini 3.1 Pro was already standing by as a successor. Anyone building production systems on a model has to keep an active eye on these deprecations.

11. Market Position and Conclusion

The LLM market of 2026 has grown up. Instead of one dominant model, there's a tight leading pack of OpenAI, Anthropic, and Google, closely chased by open models from China, led by DeepSeek and Moonshot.

Bottom line:

Performance at the top is remarkably close together, and the competition is shifting to price, context length, and specialization. For most applications in 2026, it matters less which model is the absolute best and more which one is right for the specific purpose and budget. If you want to dig deeper into individual providers, you'll find the details in our statistics on OpenAI, Anthropic, Google Gemini, Grok, and DeepSeek.