What triggered the current AI boom?

The trigger was ChatGPT in November 2022. The groundwork, however, was laid back in 2017 by the Transformer architecture, followed by GPT-2 (2019) and GPT-3 (2020). ChatGPT made the technology tangible for a mass audience for the first time, reaching 100 million users in two months.

What does the AI "intelligence explosion" mean?

It refers to the rapid pace at which AI models keep improving. On GPQA Diamond, a PhD-level test, the best models climbed from around 39% (GPT-4, 2023) to over 94% (2026). At the same time, context windows grew from a few thousand to millions of tokens, while prices per query fell sharply.

Are open models as good as proprietary ones?

The gap has narrowed sharply. With Llama 3.1 405B (2024) and DeepSeek-V3 (2024), open models came close to the frontier for the first time. By 2026, open MoE models like DeepSeek-V4-Pro (1.6 trillion parameters) and Kimi K2.6 reach trillion scale and trail the best proprietary systems by only a small margin on many tasks.

How fast do new AI models appear?

Ever faster. Almost three years passed between GPT-3 (2020) and GPT-4 (2023). By 2026 the leading labs often ship monthly; in April 2026 alone, several frontier models arrived from OpenAI, Anthropic, DeepSeek, Moonshot, and Alibaba.

Which modalities does the timeline cover?

The timeline covers text and language models, image generators, video and audio AI, as well as multimodal models and agents or developer tools. You can filter the entries by modality, license (open or proprietary), developer, and year.

Where do the numbers and data come from?

Model release dates, parameter counts, context windows, and benchmark scores come from a centrally maintained dataset based on official announcements, model cards, and research papers. Benchmark scores for older models are individually sourced. Every chart can be exported as an image and is available under CC BY 4.0.

The History of AI as a Timeline

From the 2012 deep-learning breakthrough through the generative revolution to well-grounded forecasts for 2040. This interactive timeline maps the most important AI models and tools, with charts on the intelligence explosion.

The intelligence explosion

On demanding benchmarks, AI models jumped from beginner to expert level in three years. GPQA Diamond is a PhD-level test; SWE-bench Verified measures real coding tasks.

GPQA Diamond over time

OpenAI

Anthropic

Google

DeepSeek

Frontier over time

Source: vendor benchmarks and papers, single attempt

CC BY 4.0

gradually.ai

SWE-bench Verified over time

Anthropic

OpenAI

Google

DeepSeek

Frontier over time

Source: vendor benchmarks and papers, single attempt

CC BY 4.0

gradually.ai

Bigger, longer, then more efficient

First, models grew by orders of magnitude in parameters and context length. Today, parameter counts are often no longer disclosed, and the contest shifts to price and performance.

Parameter growth (log)

OpenAI

Google

NVIDIA

Meta

BigScience

TII

Mistral AI

xAI

01.AI

Anthropic

Cohere

Databricks

Microsoft

DeepSeek

Alibaba

Moonshot AI

Frontier over time

Source: gradually.ai LLM database (incl. estimates)

CC BY 4.0

gradually.ai

Context window explosion (log)

OpenAI

Anthropic

Google

Meta

Frontier over time

Source: vendor documentation

CC BY 4.0

gradually.ai

Price vs. performance (current models)

Ideal: strong + cheap

Anthropic

OpenAI

Google

DeepSeek

Moonshot AI

Efficiency frontier (best price-performance)

Source: gradually.ai LLM database

CC BY 4.0

gradually.ai

The pace accelerates

Each cell counts this timeline's models and tools per month. Yearly leaps have become monthly releases.

Releases: lowhigh(max 5)

Source: gradually.ai AI timeline

CC BY 4.0

gradually.ai

What has changed

Four dimensions compared, from 2022 to 2026.

Text

20224,000 tokens

2026over 1M tokens

Image

2022512px, no text

20262K, readable text

Video

20224 seconds, silent

2026minutes, with audio

Audio

2022robotic

2026real-time conversation

Who leads the market

Chatbot market share by web traffic

SimilarWeb (ppc.land), 05/2026

CC BY 4.0

gradually.ai

AI on the world map

Where AI models are built, how heavily AI is used, and where the most valuable AI companies sit, on a rotatable globe.

AI models per country

Loading globe…

2 models95 models

Source: gradually.ai LLM database (tracked models by developer HQ)

CC BY 4.0

gradually.ai

Where the curve points

Past trends can be extrapolated, but nothing about that is certain. Two robust data series and the wide range of serious expert forecasts, labeled honestly for what they are.

AI task horizon: from seconds to hours

measured (METR)projection (doubling every 4 to 7 months)

Measured through late 2025. The task horizon doubles roughly every seven months long-run, lately every four. METR expects month-long projects "by the end of the decade" (range roughly 2027 to 2031). Logarithmic axis; the projection is not a guarantee.

METR (2025-2026), own projection

CC BY 4.0

gradually.ai

When will general AI arrive? Estimates diverge widely

ScenarioCommunity/marketsModel estimateResearcher survey

AI 2027

Scenario; authors' median now ~2030

2027

Metaculus

First general AI (community median)

2033

Epoch AI

Transformative AI, "Direct Approach" (plausibly to 2076)

2033

Cotra (Bio-Anchors)

Transformative AI (revised forward from 2050)

2040

Grace et al. 2024

HLMI, 50% (10% by 2027); n = 2,778 researchers

2047

20302035204020452050

These estimates measure different things (scenario, market, model, survey), which is why they span ~2027 to 2047. The same researcher survey puts full automation of all jobs at around 2116. Forecasts are not guarantees.

Grace et al. 2024, Metaculus, Epoch AI, Cotra

CC BY 4.0

gradually.ai

The complete timeline

Curated milestones from the 2012 deep-learning breakthrough to well-grounded forecasts for 2040, filterable by modality, license, developer, and year.

Showing 116 of 116 entries

Modality

License & type

Jan 2040Ajeya Cotra

Forecast

Bio-anchors: transformative AI around 2040

Ajeya Cotra's "biological anchors" place transformative AI at a median of around 2040, revised forward from an original 2050. One of several model-based estimates with wide spread.

MultimodalMilestone

The History of AI as a Timeline

The intelligence explosion

Bigger, longer, then more efficient

The pace accelerates

What has changed

Who leads the market

Chatbot market share by web traffic

AI on the world map

Where the curve points

The complete timeline

Bio-anchors: transformative AI around 2040

Markets expect the first general AI

Model estimate: transformative AI

Training runs reach 2·10²⁹ FLOP

Agents handle month-long projects

Scenario: a "superhuman coder"

Claude Fable 5 and Mythos 5: the Mythos class

MiniMax M3: 1 million tokens from Shanghai

Mistral Medium 3.5: tuned for coding

Kimi K2.7 Code: an open coding model

GPT-5.6: Sol, Terra, and Luna

Claude Opus 4.8: dynamic workflows

Gemini 3.5 Flash: a fast all-rounder

Claude Opus 4.7: adaptive thinking and task budgets

GPT-5.5: agentic workflows over hours

DeepSeek-V4-Pro: 1.6 trillion parameters, open

Kimi K2.6: an open trillion-scale model from China

Qwen 3.6 Max: Alibaba's trillion-scale MoE

GPT-5.4: OpenAI keeps the pace

GPT-5.3-Codex: coding and reasoning unified

Claude 4.6: 1 million tokens and agent teams

Gemini 3.1 Pro: a double reasoning leap

Gemini 3 Pro: Google's next big leap

GPT-5.2: faster answers on the GPT-5 base

DeepSeek-V3.2: cheap open frontier performance

Mistral Large 3: Europe's flagship model

Claude Opus 4.5: new flagship

Claude Sonnet 4.5: hours-long agent runs

Sora 2 and its own social app

GPT-5: one model for everything

Nano Banana: image editing by language

Grok 4: xAI at the benchmark frontier

Claude Opus 4 and Sonnet 4: agentic coding

Google Veo 3: video with synchronized audio

Llama 4: Meta's mixture-of-experts generation

OpenAI o3 and o4-mini: reasoning with tools

ChatGPT image generation: the "Ghibli" moment

Gemini 2.5 Pro: Google takes the lead

Claude 3.7 Sonnet: first hybrid reasoning model

Claude Code: agentic coding in the terminal

Frequently asked questions about the history of AI

What triggered the current AI boom?

What does the AI "intelligence explosion" mean?

Are open models as good as proprietary ones?

How fast do new AI models appear?

Which modalities does the timeline cover?

Where do the numbers and data come from?

Prehistory & forecasts

Initial release

The History of AI as a Timeline

The intelligence explosion

Bigger, longer, then more efficient

The pace accelerates

What has changed

Who leads the market

Chatbot market share by web traffic

AI on the world map

Where the curve points

The complete timeline

Bio-anchors: transformative AI around 2040

Markets expect the first general AI

Model estimate: transformative AI

Training runs reach 2·10²⁹ FLOP

Agents handle month-long projects

Scenario: a "superhuman coder"

Claude Fable 5 and Mythos 5: the Mythos class

MiniMax M3: 1 million tokens from Shanghai

Mistral Medium 3.5: tuned for coding

Kimi K2.7 Code: an open coding model

GPT-5.6: Sol, Terra, and Luna