Skip to main content
gradually.ai logogradually.ai
  • Blog
  • About Us
AI Newsletter
AI Newsletter
  1. Home
  2. AI Blog

LLM Statistics 2026: Key Numbers, Data & Facts

Current LLM statistics on models, providers, parameters, context windows, pricing, and benchmarks. Status June 2026, from centrally maintained data.

FHFinn Hillebrandt
June 16, 2026
Auf Deutsch lesen
AI Technology
LLM Statistics 2026: Key Numbers, Data & Facts
𝕏XShare on XFacebookShare on FacebookLinkedInShare on LinkedInPinterestShare on PinterestThreadsShare on ThreadsFlipboardShare on Flipboard
Links marked with * are affiliate links. If a purchase is made through such links, we receive a commission.

Large language models are the heart of the AI revolution. But how many are there, really? Who builds them? What do they cost? And which model is actually the best?

The honest answer:

It has gotten messy. In 2026, a new top-tier model shows up roughly every month, prices swing by a factor of 600, and the single most important metric of the past few years, the parameter count, is something the big labs no longer disclose at all.

In this article, I sort through the numbers. Every value comes from our centrally maintained LLM database, the same one that powers tools like the API cost calculator, and reflects the state of June 2026.

TL;DRKey Takeaways
  • Our database tracks 93 LLMs from 16 providers, 48 of them proprietary and 45 openly available.
  • For coding, GPT-5.5 and Claude Opus 4.8 lead at around 88.6% SWE-bench. Open-weights models like DeepSeek-V4-Pro trail by just 8 percentage points.
  • Prices range from $0.05 (GPT-5 nano) to $30 (GPT-5.5 Pro) per 1M input tokens. Frontier labs no longer disclose parameter counts.
Note
This article looks at a curated selection of the most relevant language models, not every LLM ever released. The figures for model count, providers, and type split are computed directly from our model database, so they always stay current.

1. How Many Large Language Models Are There in 2026?

Our database currently tracks 93 large language models from 16 different providers, from GPT-2 back in 2019 to the latest flagships in June 2026. This is deliberately a curated selection of the most important models, not a claim to completeness.

For context:

According to the Stanford AI Index 2026, US labs alone shipped around 50 notable models in 2025, Chinese providers about 30. More than 90% of all significant frontier models now come from industry rather than academic research. The market has professionalized and concentrated.

2. The Biggest LLM Providers by Model Count

A simple indicator of how active a lab is: the number of models it maintains. The chart below shows how many of the models we track belong to each provider:

Source: gradually.ai LLM database
|
CC BY 4.0
gradually.ai

OpenAI leads with 18 models, followed by Anthropic and Google with 13 each. That number only measures how deeply a lab maintains its lineup, though, not actual usage. Real market share looks different: in AI chatbot web traffic, ChatGPT dominates, while Gemini and Claude follow behind.

3. Parameters and Architecture: The End of Size Disclosures

For years, the parameter count was the most important metric for a model. GPT-3 had 175 billion, GPT-4 an estimated 1.76 trillion. Then the labs stopped reporting the number.

Today the rule is:

For every current frontier model from OpenAI, Anthropic, Google, and xAI, the parameter count is officially unknown. Model size has become a trade secret. Concrete, confirmed numbers only exist for open-weights models, and those are huge:

DeepSeek-V4-ProMoE, 49B active
1.6T
Kimi K2.6MoE, 32B active
1T
Qwen 3.6 Maxestimated
1T
DeepSeek V3.2MoE, 37B active
685B
Mistral Large 3MoE, 41B active
675B
Llama 4 MaverickMoE, 17B active
400B
Grok-1MoE (2024)
314B
Source: gradually.ai LLM database
|
CC BY 4.0
gradually.ai

The architecture is the striking part. Almost all large models today use a Mixture-of-Experts (MoE) design, where only a fraction of the parameters is active per request. DeepSeek-V4-Pro has 1.6 trillion parameters but activates only 49 billion per token, around 3%. That makes giant models affordable to run. In total, 22 of the tracked models are built as MoE.

You can filter and search the full parameter database by provider, size, and type below. For most current frontier models, the parameter column deliberately reads "unknown":

Legend:

500B+
100-500B
20-100B
5-20B
Under 5B

Showing 93 models

Parameter sizes of popular Large Language Models (as of May 2026)
Model
Developer
Parameters
Type
Released
GPT-5.5
OpenAI
Unknown
ProprietaryApr 2026
GPT-5.5 Pro
OpenAI
Unknown
ProprietaryApr 2026
GPT-5.5 Instant
OpenAI
Unknown
ProprietaryMay 2026
GPT-5.4
OpenAI
Unknown
ProprietaryMar 2026
GPT-5.3-Codex
OpenAI
Unknown
ProprietaryFeb 2026
GPT-5.2
OpenAI
Unknown
ProprietaryDec 2025
GPT-5
OpenAI
Unknown
ProprietaryJun 2025
GPT-5 pro
OpenAI
Unknown
ProprietaryJun 2025
GPT-5 mini
OpenAI
Unknown
ProprietaryJun 2025
GPT-3.5 Turbo
OpenAI
Unknown
ProprietaryNov 2022
o3
OpenAI
Unknown
ProprietaryApr 2025
o4-mini
OpenAI
Unknown
ProprietaryApr 2025
o1
OpenAI
Unknown
ProprietarySep 2024
Claude Fable 5
Anthropic
Unknown
ProprietaryJun 2026
Claude Mythos 5
Anthropic
Unknown
ProprietaryJun 2026
Claude Opus 4.8
Anthropic
Unknown
ProprietaryMay 2026
Claude Opus 4.7
Anthropic
Unknown
ProprietaryApr 2026
Claude Opus 4.6
Anthropic
Unknown
ProprietaryFeb 2026
Claude Sonnet 4.6
Anthropic
Unknown
ProprietaryFeb 2026
Claude Opus 4.5
Anthropic
Unknown
ProprietaryNov 2025
Claude Sonnet 4.5
Anthropic
Unknown
ProprietarySep 2025
Claude Sonnet 4
Anthropic
Unknown
ProprietaryMay 2025
Gemini 3.5 Flash
MoE
Google
Unknown
ProprietaryMay 2026
Gemini 3.1 Pro
MoE
Google
Unknown
ProprietaryFeb 2026
Gemini 3 Pro
MoE
Google
Unknown
ProprietaryDec 2025
Gemini 2.0 Flash
MoE
Google
Unknown
ProprietaryDec 2024
Gemini 1.5 Pro
MoE
Google
Unknown
ProprietaryFeb 2024
Grok 4
xAI
Unknown
ProprietaryJul 2025
Grok 3
xAI
Unknown
ProprietaryFeb 2025
Grok 2
xAI
Unknown
ProprietaryAug 2024
Claude 3 Opus
Anthropic
2T*
ProprietaryMar 2024
Llama 4 Behemoth
MoE(288B active)
Meta
2T
Open WeightsApr 2025
GPT-4
MoE(220B active)
OpenAI
1.76T*
ProprietaryMar 2023
DeepSeek-V4-Pro
MoE(49B active)
DeepSeek
1.6T
Open WeightsApr 2026
Kimi K2.6
MoE(32B active)
Moonshot AI
1T
Open WeightsApr 2026
Qwen 3.6 Max-Preview
MoE
Alibaba
1T*
ProprietaryApr 2026
Yi-Large
MoE
01.AI
1T
ProprietaryMay 2024
DeepSeek-V3.2
MoE(37B active)
DeepSeek
685B
Open WeightsDec 2025
Mistral Large 3
MoE(41B active)
Mistral AI
675B
ProprietaryDec 2025
DeepSeek-V3
MoE(37B active)
DeepSeek
671B
Open WeightsDec 2024
DeepSeek-R1
MoE(37B active)
DeepSeek
671B
Open WeightsJan 2025
PaLM
Google
540B
ProprietaryApr 2022
Megatron-Turing NLG
NVIDIA
530B
ProprietaryJan 2022
Llama 3.1 405B
Meta
405B
Open WeightsJul 2024
Llama 4 Maverick
MoE(17B active)
Meta
400B
Open WeightsApr 2025
Nemotron-4 340B
NVIDIA
340B
Open WeightsJun 2024
PaLM 2
Google
340B*
ProprietaryMay 2023
Grok 1
MoE(86B active)
xAI
314B
Open WeightsNov 2023
DeepSeek-V2
MoE(21B active)
DeepSeek
236B
Open WeightsMay 2024
GPT-4o
OpenAI
200B*
ProprietaryMay 2024
Falcon 180B
TII
180B
Open WeightsSep 2023
Mixtral 8x22B
MoE(44B active)
Mistral AI
176B
Open WeightsApr 2024
BLOOM
BigScience
176B
Open SourceJul 2022
GPT-3
OpenAI
175B
ProprietaryJun 2020
Claude 3.5 Sonnet
Anthropic
175B*
ProprietaryJun 2024
OPT-175B
Meta
175B
Open SourceMay 2022
LaMDA
Google
137B
ProprietaryJan 2022
DBRX
MoE(36B active)
Databricks
132B
Open WeightsMar 2024
Mistral Large 2
Mistral AI
123B
Open WeightsJul 2024
Command A
Cohere
111B
ProprietaryMar 2025
Llama 4 Scout
MoE(17B active)
Meta
109B
Open WeightsApr 2025
Command R+
Cohere
104B
Open WeightsApr 2024
Qwen 2.5 72B
Alibaba
72B
Open WeightsSep 2024
Claude 3 Sonnet
Anthropic
70B*
ProprietaryMar 2024
Llama 3.3 70B
Meta
70B
Open WeightsDec 2024
Llama 3.1 70B
Meta
70B
Open WeightsJul 2024
Llama 3 70B
Meta
70B
Open WeightsApr 2024
Llama 2 70B
Meta
70B
Open WeightsJul 2023
Mixtral 8x7B
MoE(14B active)
Mistral AI
56B
Open WeightsDec 2023
Falcon 40B
TII
40B
Open SourceMay 2023
Yi-34B
01.AI
34B
Open WeightsNov 2023
Qwen 2.5 32B
Alibaba
32B
Open WeightsSep 2024
Command R
Cohere
32B
Open WeightsMar 2024
Gemma 2 27B
Google
27B
Open WeightsJun 2024
Claude 3 Haiku
Anthropic
20B*
ProprietaryMar 2024
Qwen 2.5 14B
Alibaba
14B
Open WeightsSep 2024
Phi-4
Microsoft
14B
Open WeightsDec 2024
Gemma 2 9B
Google
9B
Open WeightsJun 2024
GPT-4o mini
OpenAI
8B*
ProprietaryJul 2024
Llama 3.1 8B
Meta
8B
Open WeightsJul 2024
Llama 3 8B
Meta
8B
Open WeightsApr 2024
Ministral 8B
Mistral AI
8B
Open WeightsOct 2024
Mistral 7B
Mistral AI
7B
Open SourceSep 2023
Qwen 2.5 7B
Alibaba
7B
Open WeightsSep 2024
Phi-4 Multimodal
Microsoft
5.6B
Open WeightsFeb 2025
Phi-4 mini
Microsoft
3.8B
Open WeightsFeb 2025
Phi-3 mini
Microsoft
3.8B
Open WeightsApr 2024
Gemini Nano 2
Google
3.3B
ProprietaryDec 2023
Ministral 3B
Mistral AI
3B
Open WeightsOct 2024
Gemma 2 2B
Google
2B
Open WeightsJul 2024
Gemini Nano 1
Google
1.8B
ProprietaryDec 2023
GPT-2
OpenAI
1.5B
Open SourceFeb 2019
Qwen 2.5 0.5B
Alibaba
0.5B
Open WeightsSep 2024

Parameter sizes of popular Large Language Models (as of May 2026)

4. Context Windows: From 200,000 to 10 Million Tokens

The context window determines how much text a model can process at once. Here the orders of magnitude have multiplied over the past two years. The overview below covers more than 140 current models, sortable and filterable by provider:

Legend:
1M+ Tokens
200K-1M Tokens
100K-200K Tokens
32K-100K Tokens
Under 32K Tokens
Showing 165 models
Context window sizes of current AI language models (as of May 2026)
Model
Developer
Context Window
Equivalent to
Llama 4 Scout
Meta
10M
≈ 25,000 pages (about 30 Harry Potter books)
Qwen-Long
Alibaba
10M
≈ 25,000 pages (about 30 Harry Potter books)
Gemini 2.0 Pro
Google
2M
≈ 5,000 pages (about 6 Harry Potter books)
Gemini 1.5 Pro
Google
2M
≈ 5,000 pages (about 6 Harry Potter books)
Grok 4.1 Fast
xAI
2M
≈ 5,000 pages (about 6 Harry Potter books)
Grok 4 Fast
xAI
2M
≈ 5,000 pages (about 6 Harry Potter books)
Llama 4 Maverick
Meta
1M
≈ 2,500 pages (about 3 Harry Potter books)
Gemini 3.5 Flash
Google
1M
≈ 2,500 pages (about 3 Harry Potter books)
Gemini 3.1 Pro
Google
1M
≈ 2,500 pages (about 3 Harry Potter books)
Gemini 3 Pro
Google
1M
≈ 2,500 pages (about 3 Harry Potter books)
Gemini 3 Flash
Google
1M
≈ 2,500 pages (about 3 Harry Potter books)
Gemini 2.5 Pro
Google
1M
≈ 2,500 pages (about 3 Harry Potter books)
Gemini 2.5 Flash
Google
1M
≈ 2,500 pages (about 3 Harry Potter books)
Gemini 2.5 Flash-Lite
Google
1M
≈ 2,500 pages (about 3 Harry Potter books)
Gemini 2.0 Flash
Google
1M
≈ 2,500 pages (about 3 Harry Potter books)
Gemini 1.5 Flash
Google
1M
≈ 2,500 pages (about 3 Harry Potter books)
Claude Fable 5
Anthropic
1M
≈ 2,500 pages (about 3 Harry Potter books)
Claude Mythos 5
Anthropic
1M
≈ 2,500 pages (about 3 Harry Potter books)
Claude Opus 4.8
Anthropic
1M
≈ 2,500 pages (about 3 Harry Potter books)
Claude Opus 4.7
Anthropic
1M
≈ 2,500 pages (about 3 Harry Potter books)
Claude Opus 4.6 (1M Beta)
Anthropic
1M
≈ 2,500 pages (about 3 Harry Potter books)
Claude Sonnet 4.6 (1M Beta)
Anthropic
1M
≈ 2,500 pages (about 3 Harry Potter books)
Claude Sonnet 4.5 (1M Beta)
Anthropic
1M
≈ 2,500 pages (about 3 Harry Potter books)
Claude Sonnet 4 (1M Beta)
Anthropic
1M
≈ 2,500 pages (about 3 Harry Potter books)
GPT-5.5
OpenAI
1M
≈ 2,500 pages (about 3 Harry Potter books)
GPT-5.5 Pro
OpenAI
1M
≈ 2,500 pages (about 3 Harry Potter books)
GPT-4.1
OpenAI
1M
≈ 2,500 pages (about 3 Harry Potter books)
GPT-4.1 mini
OpenAI
1M
≈ 2,500 pages (about 3 Harry Potter books)
GPT-4.1 nano
OpenAI
1M
≈ 2,500 pages (about 3 Harry Potter books)
DeepSeek V4 Pro
DeepSeek
1M
≈ 2,500 pages (about 3 Harry Potter books)
Qwen-Plus
Alibaba
1M
≈ 2,500 pages (about 3 Harry Potter books)
Qwen-Turbo
Alibaba
1M
≈ 2,500 pages (about 3 Harry Potter books)
Amazon Nova Premier
Amazon
1M
≈ 2,500 pages (about 3 Harry Potter books)
Amazon Nova 2 Lite
Amazon
1M
≈ 2,500 pages (about 3 Harry Potter books)
Amazon Nova 2 Sonic
Amazon
1M
≈ 2,500 pages (about 3 Harry Potter books)
MiniMax-01
MiniMax
1M
≈ 2,500 pages (about 3 Harry Potter books)
GPT-5.5 Instant
OpenAI
400K
≈ 1,000 pages (about 4 novels)
GPT-5.3-Codex
OpenAI
400K
≈ 1,000 pages (about 4 novels)
GPT-5.2
OpenAI
400K
≈ 1,000 pages (about 4 novels)
GPT-5.2 Pro
OpenAI
400K
≈ 1,000 pages (about 4 novels)
GPT-5.1
OpenAI
400K
≈ 1,000 pages (about 4 novels)
GPT-5
OpenAI
400K
≈ 1,000 pages (about 4 novels)
GPT-5 mini
OpenAI
400K
≈ 1,000 pages (about 4 novels)
GPT-5 nano
OpenAI
400K
≈ 1,000 pages (about 4 novels)
Amazon Nova Pro
Amazon
300K
≈ 750 pages (about 3 novels)
Amazon Nova Lite
Amazon
300K
≈ 750 pages (about 3 novels)
Kimi K2.6
Moonshot AI
262.14K
≈ 655 pages (about 2 novels)
Qwen 3.6 Max-Preview
Alibaba
262.14K
≈ 655 pages (about 2 novels)
Qwen3-Max
Alibaba
262.14K
≈ 655 pages (about 2 novels)
Grok 4.1
xAI
256K
≈ 640 pages (about 2 novels)
Grok 4
xAI
256K
≈ 640 pages (about 2 novels)
Mistral Large 3
Mistral
256K
≈ 640 pages (about 2 novels)
Codestral Mamba
Mistral
256K
≈ 640 pages (about 2 novels)
Qwen3-235B-A22B (256K Update)
Alibaba
256K
≈ 640 pages (about 2 novels)
Command A
Cohere
256K
≈ 640 pages (about 2 novels)
Command A Reasoning
Cohere
256K
≈ 640 pages (about 2 novels)
Jamba 1.5 Large
AI21 Labs
256K
≈ 640 pages (about 2 novels)
Jamba 1.5 Mini
AI21 Labs
256K
≈ 640 pages (about 2 novels)
Jamba
AI21 Labs
256K
≈ 640 pages (about 2 novels)
abab6.5s
MiniMax
245.76K
≈ 614 pages (about 2 novels)
Claude Opus 4.6
Anthropic
200K
≈ 500 pages (about 2 novels)
Claude Sonnet 4.6
Anthropic
200K
≈ 500 pages (about 2 novels)
Claude Opus 4.5
Anthropic
200K
≈ 500 pages (about 2 novels)
Claude Sonnet 4.5
Anthropic
200K
≈ 500 pages (about 2 novels)
Claude Sonnet 4
Anthropic
200K
≈ 500 pages (about 2 novels)
Claude Opus 4
Anthropic
200K
≈ 500 pages (about 2 novels)
Claude 3.5 Sonnet
Anthropic
200K
≈ 500 pages (about 2 novels)
Claude 3.5 Haiku
Anthropic
200K
≈ 500 pages (about 2 novels)
Claude 3 Opus
Anthropic
200K
≈ 500 pages (about 2 novels)
Claude 3 Sonnet
Anthropic
200K
≈ 500 pages (about 2 novels)
Claude 3 Haiku
Anthropic
200K
≈ 500 pages (about 2 novels)
o3
OpenAI
200K
≈ 500 pages (about 2 novels)
o4-mini
OpenAI
200K
≈ 500 pages (about 2 novels)
o3-mini
OpenAI
200K
≈ 500 pages (about 2 novels)
o1
OpenAI
200K
≈ 500 pages (about 2 novels)
Yi-34B-200K
01.AI
200K
≈ 500 pages (about 2 novels)
Yi-6B-200K
01.AI
200K
≈ 500 pages (about 2 novels)
Grok 3
xAI
131.07K
≈ 328 pages (about 1 novel)
Llama 3.3 70B
Meta
128K
≈ 320 pages (about 1 novel)
Llama 3.2 90B Vision
Meta
128K
≈ 320 pages (about 1 novel)
Llama 3.2 11B Vision
Meta
128K
≈ 320 pages (about 1 novel)
Llama 3.2 3B
Meta
128K
≈ 320 pages (about 1 novel)
Llama 3.2 1B
Meta
128K
≈ 320 pages (about 1 novel)
Llama 3.1 405B
Meta
128K
≈ 320 pages (about 1 novel)
Llama 3.1 70B
Meta
128K
≈ 320 pages (about 1 novel)
Llama 3.1 8B
Meta
128K
≈ 320 pages (about 1 novel)
Gemma 3 27B
Google
128K
≈ 320 pages (about 1 novel)
Gemma 3 12B
Google
128K
≈ 320 pages (about 1 novel)
Gemma 3 4B
Google
128K
≈ 320 pages (about 1 novel)
Grok 2
xAI
128K
≈ 320 pages (about 1 novel)
o1-mini
OpenAI
128K
≈ 320 pages (about 1 novel)
GPT-4.5
OpenAI
128K
≈ 320 pages (about 1 novel)
GPT-4o
OpenAI
128K
≈ 320 pages (about 1 novel)
GPT-4o mini
OpenAI
128K
≈ 320 pages (about 1 novel)
GPT-4 Turbo
OpenAI
128K
≈ 320 pages (about 1 novel)
DeepSeek V3.1
DeepSeek
128K
≈ 320 pages (about 1 novel)
DeepSeek V3
DeepSeek
128K
≈ 320 pages (about 1 novel)
DeepSeek R1
DeepSeek
128K
≈ 320 pages (about 1 novel)
DeepSeek R1 Distill Llama 70B
DeepSeek
128K
≈ 320 pages (about 1 novel)
DeepSeek R1 Distill Qwen 32B
DeepSeek
128K
≈ 320 pages (about 1 novel)
DeepSeek R1 Distill Qwen 14B
DeepSeek
128K
≈ 320 pages (about 1 novel)
DeepSeek R1 Distill Qwen 7B
DeepSeek
128K
≈ 320 pages (about 1 novel)
DeepSeek R1 Distill Llama 8B
DeepSeek
128K
≈ 320 pages (about 1 novel)
DeepSeek V2.5
DeepSeek
128K
≈ 320 pages (about 1 novel)
DeepSeek Coder V2
DeepSeek
128K
≈ 320 pages (about 1 novel)
Mistral Large 2
Mistral
128K
≈ 320 pages (about 1 novel)
Mistral Small 3
Mistral
128K
≈ 320 pages (about 1 novel)
Ministral 8B
Mistral
128K
≈ 320 pages (about 1 novel)
Ministral 3B
Mistral
128K
≈ 320 pages (about 1 novel)
Mistral NeMo
Mistral
128K
≈ 320 pages (about 1 novel)
Qwen3-235B-A22B
Alibaba
128K
≈ 320 pages (about 1 novel)
Qwen3-32B
Alibaba
128K
≈ 320 pages (about 1 novel)
Qwen3-14B
Alibaba
128K
≈ 320 pages (about 1 novel)
Qwen3-8B
Alibaba
128K
≈ 320 pages (about 1 novel)
Qwen3-30B-A3B
Alibaba
128K
≈ 320 pages (about 1 novel)
Qwen 2.5 72B
Alibaba
128K
≈ 320 pages (about 1 novel)
Qwen 2.5 32B
Alibaba
128K
≈ 320 pages (about 1 novel)
Qwen 2.5 14B
Alibaba
128K
≈ 320 pages (about 1 novel)
Qwen 2.5 7B
Alibaba
128K
≈ 320 pages (about 1 novel)
Qwen 2.5 Coder 32B
Alibaba
128K
≈ 320 pages (about 1 novel)
Qwen 2.5 Coder 14B
Alibaba
128K
≈ 320 pages (about 1 novel)
Qwen 2.5 Coder 7B
Alibaba
128K
≈ 320 pages (about 1 novel)
Command R+
Cohere
128K
≈ 320 pages (about 1 novel)
Command R
Cohere
128K
≈ 320 pages (about 1 novel)
Amazon Nova Micro
Amazon
128K
≈ 320 pages (about 1 novel)
Phi-4-mini
Microsoft
128K
≈ 320 pages (about 1 novel)
Phi-3.5-mini
Microsoft
128K
≈ 320 pages (about 1 novel)
Phi-3.5-MoE
Microsoft
128K
≈ 320 pages (about 1 novel)
Phi-3 Medium
Microsoft
128K
≈ 320 pages (about 1 novel)
Phi-3 Small
Microsoft
128K
≈ 320 pages (about 1 novel)
Phi-3 Mini
Microsoft
128K
≈ 320 pages (about 1 novel)
Yi-Coder 9B
01.AI
128K
≈ 320 pages (about 1 novel)
Yi-Coder 1.5B
01.AI
128K
≈ 320 pages (about 1 novel)
Llama-3.1-Nemotron-70B
Nvidia
128K
≈ 320 pages (about 1 novel)
Llama-3.1-Nemotron-51B
Nvidia
128K
≈ 320 pages (about 1 novel)
Mistral-NeMo-Minitron 8B
Nvidia
128K
≈ 320 pages (about 1 novel)
Reka Core
Reka
128K
≈ 320 pages (about 1 novel)
Reka Flash
Reka
128K
≈ 320 pages (about 1 novel)
Reka Edge
Reka
128K
≈ 320 pages (about 1 novel)
GLM-4
Zhipu AI
128K
≈ 320 pages (about 1 novel)
ChatGLM3-6B
Zhipu AI
128K
≈ 320 pages (about 1 novel)
ERNIE 4.0
Baidu
128K
≈ 320 pages (about 1 novel)
Mixtral 8x22B
Mistral
65.54K
≈ 164 pages
Phi-4-mini-flash-reasoning
Microsoft
64K
≈ 160 pages
Mixtral 8x7B
Mistral
32.77K
≈ 82 pages
Codestral
Mistral
32.77K
≈ 82 pages
Qwen3-4B
Alibaba
32.77K
≈ 82 pages
Qwen3-1.7B
Alibaba
32.77K
≈ 82 pages
Qwen3-0.6B
Alibaba
32.77K
≈ 82 pages
Phi-4-reasoning
Microsoft
32.77K
≈ 82 pages
DBRX
Databricks
32.77K
≈ 82 pages
Gemma 3 1B
Google
32K
≈ 80 pages
Yi-Large
01.AI
32K
≈ 80 pages
Phi-4
Microsoft
16.38K
≈ 41 pages
Yi-Zap
01.AI
16K
≈ 40 pages
Gemma 2 27B
Google
8.19K
≈ 20 pages
Gemma 2 9B
Google
8.19K
≈ 20 pages
GPT-4
OpenAI
8.19K
≈ 20 pages
Jurassic-2 Ultra
AI21 Labs
8.19K
≈ 20 pages
GLM-4V
Zhipu AI
8.19K
≈ 20 pages
ERNIE 3.5
Baidu
8K
≈ 20 pages
Command
Cohere
4.1K
≈ 10 pages
Nemotron-4 340B
Nvidia
4.1K
≈ 10 pages
StableLM 2 12B
Stability AI
4.1K
≈ 10 pages
StableLM Zephyr 3B
Stability AI
4.1K
≈ 10 pages

Context window sizes of current AI language models (as of May 2026)

At the top are Llama 4 Scout and Qwen-Long with 10 million tokens each. That's roughly 30 Harry Potter books in a single prompt. The current all-rounders like GPT-5.5, Claude Opus 4.8, and Gemini 3.1 Pro sit at 1 million tokens, which is more than enough for most use cases. For more on the individual model families, see our overviews of the Claude models and Gemini models.

5. What Does an LLM Cost? Prices per 1 Million Tokens

API prices span worlds. The cheapest model with API access is GPT-5 nano at $0.05 per 1M input tokens. The most expensive is GPT-5.5 Pro at $30, a 600x difference.

More interesting than the raw price is the ratio of price to performance. The chart below plots input price against coding performance (SWE-bench Verified). Models toward the bottom right are ideal: strong and cheap.

Price-performance: SWE-bench vs. input price
OpenAI
Anthropic
Google
DeepSeek
Moonshot AI
Efficiency frontier (best price-performance)
Sources: gradually.ai LLM database (pricing + benchmarks)
|
CC BY 4.0
gradually.ai

The quiet star of this chart is DeepSeek-V4-Pro. At 80.6% SWE-bench for just $0.435 input price, it sits right on the efficiency frontier, no other model is both stronger and cheaper. So if you don't strictly need the last few percentage points of coding performance, the open models offer an extremely good price-performance ratio. For a detailed cost estimate of your specific usage, see the API cost calculator.

6. LLM Performance Head to Head

To make the strengths and weaknesses of the top models visible at a glance, the radar below compares five representative frontier models across four dimensions: reasoning, coding, context window, and price efficiency. Each axis is scaled relative to the five models so even small leads become visible. The real values appear in the tooltip.

Claude Opus 4.8
Gemini 3.1 Pro
Gemini 3.5 Flash
Claude Sonnet 4.6
GPT-5.5
Sources: Artificial Analysis (GPQA Diamond, as of 2026-06-17), gradually.ai LLM database (SWE-bench, context, pricing)
|
CC BY 4.0
gradually.ai

The pattern is clear. Claude Opus 4.8 and GPT-5.5 dominate on raw coding performance but are expensive. Gemini 3.5 Flash flips that, nearly on par on reasoning and only trailing on coding, yet with the best price efficiency in the field. Every AI project comes down to this one trade-off in the end, maximum quality versus maximum economy.

7. Open Source vs. Proprietary

One of the most important developments of 2026 is the catch-up of open models. Of the 93 tracked models, 48 are proprietary and 45 are openly available, 40 of them open-weights and 5 fully open-source.

But at the very top:

According to the Stanford AI Index 2026, the best closed model led the best open-weights model by 3.3 percentage points in early 2026. In August 2024, the gap had been only 0.5 percentage points. So at the top it has not been shrinking but widening again, with six of the top-ten models in the Chatbot Arena now closed once more. Our data shows the same lead on coding: DeepSeek-V4-Pro (80.6% SWE-bench) and Kimi K2.6 (80.2%) trail the closed leader GPT-5.5 (88.7%) by about 8 percentage points. For an overview of the best free models, see our article on open-source LLMs.

8. Knowledge Cutoff: How Current Are the Models?

Every model has a knowledge cutoff, after which it has learned nothing more about the world. Right now the freshest cutoff in our database is October 2025:

GPT-5.5 Instant
Oct. 2025
GPT-5.3 Codex
Oct. 2025
GPT-5.2
Aug. 2025
Claude Opus 4.6
May 2025
Claude Sonnet 4.6
May 2025
Claude Opus 4.5
Mar. 2025
Gemini 3.1 Pro
Jan. 2025
Gemini 3 Flash
Jan. 2025
Gemini 2.5 Pro
Jan. 2025
DeepSeek R1
Jan. 2025
DeepSeek V3.1
Dec. 2024
Grok 4.1
Nov. 2024
Qwen3-Max
Nov. 2024
Mistral Large 3
Oct. 2024
GPT-5
Oct. 2024
Llama 4 Scout
Aug. 2024
Gemini 2.0 Flash
Aug. 2024
Amazon Nova Pro
Aug. 2024
GPT-4.1
June 2024
GPT-5 mini
May 2024
Source: gradually.ai LLM database
|
CC BY 4.0
gradually.ai

Between the knowledge cutoff and the release date there are usually six to eight months in which the model is trained and tested. For current events, the models therefore almost always need a web search. Raw model knowledge is always a few months old.

9. Release Pace: The Cadence of the Labs

How fast the market moves shows in the release timeline. What happened quarterly in 2024 comes almost monthly in 2026:

May 2024
GPT-4o
OpenAI makes real-time multimodal models the default.
Jan. 2025
DeepSeek-R1
First open reasoning model at frontier level, kicking off the open-weights wave.
June 2025
GPT-5
OpenAI merges reasoning and standard mode into one model family.
Dec. 2025
Gemini 3 Pro
Google opens the third Gemini generation with its first model.
Dec. 2025
GPT-5.2
OpenAI follows up with an improved reasoning update.
Dec. 2025
Mistral Large 3
Mistral counters with an open MoE model from Europe.
Feb. 2026
Claude Opus 4.6
Anthropic raises the reasoning bar with the new Opus.
Feb. 2026
Gemini 3.1 Pro
Google takes the GPQA Diamond lead at 94.3%.
April 2026
GPT-5.5
Sets the new coding record at 88.7% SWE-bench.
April 2026
Claude Opus 4.7
Anthropic stays right behind GPT-5.5 on coding.
April 2026
DeepSeek-V4-Pro
Open model hits 80.6% SWE-bench at a fraction of the price.
May 2026
Claude Opus 4.8
Hits 88.6% SWE-bench, effectively level with GPT-5.5.
May 2026
Gemini 3.5 Flash
Google ships a fast, price-efficient Flash model.
June 2026
Claude Fable 5
Anthropic expands the lineup with a specialized variant.
June 2026
Claude Mythos 5
A second specialized model, available through the API at first.

December 2025 was especially dense, when Google, OpenAI, and Mistral all shipped new flagships in the same month. So was April 2026, which brought GPT-5.5, Claude Opus 4.7, DeepSeek-V4-Pro, Kimi K2.6, and Qwen 3.6 Max, five top models at once. If you want to keep up here, don't cling too tightly to individual version numbers.

10. Model Status: Active, Deprecated, Legacy

Not every model ever released is still usable. Across the three big providers Anthropic, Google, and OpenAI, we track the lifecycle of 77 models. Here is how they split across the individual statuses:

77models
Active3950.6%
Deprecated2431.2%
Legacy67.8%
Pro-exclusive33.9%
API only22.6%
Preview11.3%
Open source22.6%
Source: gradually.ai LLM database
|
CC BY 4.0
gradually.ai

Just over half of the models are still active, and nearly a third are already deprecated. And lifecycles are getting shorter. A good example is Gemini 3 Pro, deprecated only about three months after its release because Gemini 3.1 Pro was already standing by as a successor. Anyone building production systems on a model has to keep an active eye on these deprecations.

11. Market Position and Conclusion

The LLM market of 2026 has grown up. Instead of one dominant model, there's a tight leading pack of OpenAI, Anthropic, and Google, closely chased by open models from China, led by DeepSeek and Moonshot.

Bottom line:

Performance at the top is remarkably close together, and the competition is shifting to price, context length, and specialization. For most applications in 2026, it matters less which model is the absolute best and more which one is right for the specific purpose and budget. If you want to dig deeper into individual providers, you'll find the details in our statistics on OpenAI, Anthropic, Google Gemini, Grok, and DeepSeek.

Frequently Asked Questions

𝕏XShare on XFacebookShare on FacebookLinkedInShare on LinkedInPinterestShare on PinterestThreadsShare on ThreadsFlipboardShare on Flipboard
FH

Finn Hillebrandt

AI Expert & Blogger

Finn Hillebrandt is the founder of Gradually AI, an SEO and AI expert. He helps online entrepreneurs simplify and automate their processes and marketing with AI. Finn shares his knowledge here on the blog in 50+ articles as well as through his ChatGPT Course and the AI Business Club.

Learn more about Finn and the team, follow Finn on LinkedIn, join his Facebook group for ChatGPT, OpenAI & AI Tools or do like 17,500+ others and subscribe to his AI Newsletter with tips, news and offers about AI tools and online business. Also visit his other blog, Blogmojo, which is about WordPress, blogging and SEO.

Similar Articles

Anthropic IPO 2026: Valuation, Stock & Date
AI Technology

Anthropic IPO 2026: Valuation, Stock & Date

June 17, 2026
FHFinn Hillebrandt
OpenAI IPO 2026: Valuation, Stock & How to Buy
AI Technology

OpenAI IPO 2026: Valuation, Stock & How to Buy

June 17, 2026
FHFinn Hillebrandt
SpaceX IPO 2026: Valuation, Stock & Share Price
AI Technology

SpaceX IPO 2026: Valuation, Stock & Share Price

June 17, 2026
FHFinn Hillebrandt
SpaceX Statistics 2026: Key Numbers, Data & Facts
AI Technology

SpaceX Statistics 2026: Key Numbers, Data & Facts

June 16, 2026
FHFinn Hillebrandt
Claude Statistics 2026: Numbers, Data & Facts About Anthropic
AI Technology

Claude Statistics 2026: Numbers, Data & Facts About Anthropic

June 14, 2026
FHFinn Hillebrandt
DeepSeek Statistics 2026: Key Numbers, Data & Facts
AI Technology

DeepSeek Statistics 2026: Key Numbers, Data & Facts

June 14, 2026
FHFinn Hillebrandt

Stay Updated with the AI Newsletter

Get the latest AI tools, tutorials, and exclusive tips delivered to your inbox weekly

Unsubscribe anytime. About 4 to 8 emails per month. Consent includes notes on revocation, service provider, and statistics according to our Privacy Policy.

gradually.ai logogradually.ai

Germany's leading platform for AI tools and knowledge for online entrepreneurs.

AI Tools

  • Text Generator
  • Prompt Enhancer
  • Prompt Link Generator
  • FLUX AI Image Generator
  • AI Art Generator
  • Midjourney Prompt Generator
  • Veo 3 Prompt Generator
  • AI Humanizer
  • AI Text Detector
  • Gemini Watermark Remover
  • All Tools →

Creative Tools

  • Blog Name Generator
  • AI Book Title Generator
  • Song Lyrics Generator
  • Artist Name Generator
  • Team Name Generator
  • AI Mindmap Generator
  • Headline Generator
  • Company Name Generator
  • AI Slogan Generator
  • Brand Name Generator
  • Newsletter Name Generator
  • YouTube Channel Name Generator

Business Tools

  • API Cost Calculator
  • Token Counter
  • AI Ad Generator
  • AI Copy Generator
  • Essay Generator
  • Story Generator
  • AI Rewrite Generator
  • Blog Post Generator
  • Meta Description Generator
  • AI Email Generator
  • Email Subject Line Generator
  • Instagram Bio Generator
  • AI Hashtag Generator

Resources

  • Claude Code MCP Servers
  • Claude Code Skills
  • n8n Hosting Comparison
  • OpenClaw Hosting Comparison
  • Claude Code Plugins
  • Claude Code Use Cases
  • Claude Cowork Use Cases
  • OpenClaw Use Cases
  • Changelogs

© 2026 Gradually AI. All rights reserved.

  • Blog
  • About Us
  • Legal Notice
  • Privacy Policy