Timeline to AGI.

When the labs building frontier AI expect it to reach human level, and just as important, what each step unlocks and what to do about it. Updated every time a new model moves the date.

What “AGI” means here

“When I say AGI, I mean a machine that can do the sorts of cognitive things that people can typically do, possibly more.”

Shane LeggCo-founder & Chief AGI Scientist, Google DeepMindDwarkesh Patel interview (Oct 2023)

We anchor to Legg's definition because it's the most cited and the most testable: AGI is reached only when expert teams, with full access to the system, can no longer find a cognitive task where it falls below a typical human. Legg has put the odds at 50% by 2028 — an estimate he's held since his 2011 blog post.

Definition

'262026

'272027

'282028

'292029

'302030

'312031

'322032

Peregrinations
Our view
2028–2029
Google DeepMind
Shane Legg
Co-founder & Chief AGI Scientist, Google DeepMind
2028
Demis Hassabis
CEO, Google DeepMind
2029–2031
Anthropic
Dario Amodei
CEO, Anthropic
2026–2030
Jack Clark
Co-founder, Anthropic
2028
OpenAI
Sam Altman
CEO, OpenAI
2027–2032

The measured curve · When

How fast is capability actually moving?

Scale

Reliability

Each dot is a model. Left → right is when it shipped; bottom → top is how long a task it finishes unattended at 50% reliability. Color marks the lab. The solid line is the trend; the dashed line continues it at the measured doubling.

GPT-4 handled ~4-minute tasks in 2023. Three years on, Claude Opus 4.6 handles ~12 hr at 50% reliability, though only ~1.2 hr at the stricter 80% bar. Doubling about every 4.2 months.

~4.2 mo

50% horizon doubling (METR, since 2023)

12 hr

Claude Opus 4.6 · 50% reliability (once)

1.2 hr

Claude Opus 4.6 · 80% reliability (dependably)

1 min

1 hr

work-day

work-week

work-month

2019

2021

2023

2025

2027

GPT-4 · 4 min

o3 · 2 hr

Claude Opus 4.6 · 12 hr

AnthropicOpenAIGoogleOur projection · ~4.2-mo doubling

Log scale: each gridline is a 10× jump, so steady doubling shows up as a straight climb — the cleanest way to read the trend.

a full work-day · 8 hr

Reached

Claude Opus 4.6 clears it at 50% reliability

a work-week · 40 hr

Sep 2026

range Aug 2026 – Dec 2026

a work-month · ≈167 work-hours

Jun 2027

range Mar 2027 – Jan 2028

Why does the dashed line climb so fast? It isn't drawn by hand; it continues the measured pace. The 50% horizon doubles about every 4.2 months (METR, since 2023); hold that and it compounds, about 5.7 doublings in two years, roughly 51×. Compounding, not optimism, is what lifts the line.

Extrapolated, the curve reaches a work-month of autonomous work around Jun 2027 (range Mar 2027–Jan 2028) at 50% reliability. 50% means “succeeds about half the time”; 80% is the dependable bar and lands later. Toggle the reliability above to compare. The 50% path lands on our end-2028 AGI call from the evidence side, not the opinion side.

Data: every point is METR's measured 50% and 80% time horizon (METR-Horizon-v1.1). Source: METR. Trend & extrapolation are ours, on METR's data.

The fork · When, three ways

Same data. The rate is the whole argument.

The curve above shows how fast capability has grown. But when AI can do a work-month (≈167 work-hours) of expert work unattended depends entirely on what the rate does next. Three assumptions, anchored on the same measured point,Claude Opus 4.6 at ~12 hr, and how far apart they land.

work-day

work-week

work-month

2026

2027

2028

2029

Today · 12 hr

Stalled · linearCurrent trendRecursive self-improvement

Log scale: the trend is a straight line, the stalled path bends down, the RSI path bends up, so the three rate regimes read at a glance.

Stalled · linear

Sep 2032off the chart

Compounding stops — today's absolute pace (~3.9 min/day) just held flat.

Current trend

Jun 2027

METR's measured ~4.2-month doubling simply continues.

Recursive self-improvement

Jan 2027

The doubling time itself halves every ~12 months — an intelligence-explosion sketch.

Same measured data; three guesses at the rate. The date AI could sustain a work-month of expert work swings from ~Jun 2027 (trend) to ~Sep 2032 if compounding stalls, years apart, entirely from how you extrapolate. Most people extrapolate linearly and land late; capability has compounded, and compounding lands early. That gap, between the linear intuition and the exponential reality, is where the edge is. The RSI path is the tail: if models start accelerating their own progress, even the trend date is conservative.

Anchored on METR's measured 50% horizon for Claude Opus 4.6. The three paths are our illustrative rate models, what-ifs to show how much the date rides on the rate, not forecasts. These gains arrive in steps, one frontier launch at a time (see the launch cadence below).

What the dates actually buy you

A date is useless until you know what to do with it.

The curve says when. This says what changes when each bar falls, and where to stand before it does. Every rung is a measured date paired with our read on what it unlocks and how to position. The dates are METR's; the reads are ours.

1Reached
Hand a model a task that would take an expert a full day, and it finishes it — about half the time.
What it unlocks
The unit you delegate jumps from a question to a whole task. But at 50% reliability your job flips from doing the work to checking it, so the value migrates to whatever catches the other half — eval harnesses, diff review, sandboxes, replay.
How to position
Bet on the verification layer. Generation stopped being the bottleneck; trusting the output is the new one. Tooling priced on a human in every loop starts to look mispriced.
Bottleneck shifts →verification & trust
2≈ Sep 2026Aug 2026 – Dec 2026
A model sustains a week of expert work in one push — multi-day projects, not single tasks.
What it unlocks
Now you delegate a project, not a task: an agent can hold a week-long goal, sequence its own sub-tasks, and recover from its own mistakes across days. The work that survives is framing the goal and judging the result — taste and specification, not execution.
How to position
The org chart becomes the product. Whoever turns one operator into a manager of agents takes the seat — and headcount-priced software wobbles hardest here, because one person now ships a team’s output.
Bottleneck shifts →orchestration & taste
3≈ Jun 2027Mar 2027 – Jan 2028AGI proxy
A model runs a month of expert work end to end — the length of a real job’s deliverable cycle. This is the bar Legg’s definition points at.
What it unlocks
The question stops being which tasks and becomes which jobs. Whole functions — a research desk, a junior dev team, a paralegal pool — can run as a service rather than a headcount.
How to position
When cognition is cheap, the constraint moves off cognition — onto what doesn’t scale with model quality: the compute and energy to run it, the trust and liability when it’s wrong, and proprietary data nobody can reproduce. Owning the scarce complement beats owning the model.
Bottleneck shifts →compute, energy & liability

Dates are when METR's measured 50%-reliability curve crosses each bar (the band spans its fastest-to-slowest doubling). What each rung unlocks and how to position are our interpretive reads, wider error bars than the dates, and the point of the whole page.Where the edge is right now

Delta log

Every major capability event, with how it moved our range, and why.

No changemediumMay 4, 2026
Clark publishes 60% RSI-by-2028 thread
Anthropic co-founder Jack Clark, after a multi-week internal-data review, publicly assigned a 60% probability to recursive self-improvement occurring before end of 2028. Pere's range — end of 2028 to end of 2030 — already absorbs Clark's call: he's a forcing function on the low side, not a new data point that moves the curve.
Source
Pulled forwardmediumFeb 18, 2026
Hassabis tightens to "within five years" at India Summit
Six weeks after a more conservative "five to 10 years" line at Davos, Hassabis pulled in to "AGI is on the horizon, maybe within the next five years" at the India AI Impact Summit. The shift narrows the public DeepMind position toward Legg's 2028 anchor and toward Pere's 2028–2030 range. We treat this as a meaningful pull-forward from the most cautious frontier-lab CEO.
Source

Every forecast, sourced

Each estimate above is a real, public claim. Here's the verbatim quote, the date, and the link.

Google DeepMind
Shane LeggCo-founder & Chief AGI Scientist, Google DeepMindMedian 2028
“I think there's a 50% chance that we have AGI by 2028. Now, it's just a 50% chance.”
Dwarkesh Patel interview (Oct 2023)Source
Demis HassabisCEO, Google DeepMind2029–2031
“Now in 2026, we're at another threshold moment where AGI is on the horizon, maybe within the next five years.”
India AI Impact Summit (Feb 2026)Source
Anthropic
Dario AmodeiCEO, Anthropic2026–2030
“My basic prediction is that powerful AI could come as early as 2026, though there are also ways it could take much longer.”
Machines of Loving Grace (Oct 2024)Source
Jack ClarkCo-founder, AnthropicMedian 2028
“I think there's a 60% chance that recursive self-improvement (RSI) will occur before the end of 2028.”
X thread (May 4 2026)Source
OpenAI
Sam AltmanCEO, OpenAI2027–2032
“It is possible that we will have superintelligence in a few thousand days; it may take longer, but I'm confident we'll get there.”
The Intelligence Age (Sep 2024)Source

Methodology

Every forecast presented is anchored on public, verified statements from leading AI research laboratories and industry figures. Verbatim citations, precise publication dates, and primary source links are documented for every entry.

The composite range represents a statistical consensus interval rather than a single target. Updates are applied systematically based on verified capability jumps, hardware milestones, and architectural breakthroughs.

Home

Timeline to AGI.

When the labs building frontier AI expect it to reach human level, and just as important, what each step unlocks and what to do about it. Updated every time a new model moves the date.

What “AGI” means here

“When I say AGI, I mean a machine that can do the sorts of cognitive things that people can typically do, possibly more.”

Shane LeggCo-founder & Chief AGI Scientist, Google DeepMindDwarkesh Patel interview (Oct 2023)

Definition

'262026

'272027

'282028

'292029

'302030

'312031

'322032

Peregrinations
Our view
2028–2029
Google DeepMind
Shane Legg
Co-founder & Chief AGI Scientist, Google DeepMind
2028
Demis Hassabis
CEO, Google DeepMind
2029–2031
Anthropic
Dario Amodei
CEO, Anthropic
2026–2030
Jack Clark
Co-founder, Anthropic
2028
OpenAI
Sam Altman
CEO, OpenAI
2027–2032

The measured curve · When

How fast is capability actually moving?

Scale

Reliability

GPT-4 handled ~4-minute tasks in 2023. Three years on, Claude Opus 4.6 handles ~12 hr at 50% reliability, though only ~1.2 hr at the stricter 80% bar. Doubling about every 4.2 months.

~4.2 mo

50% horizon doubling (METR, since 2023)

12 hr

Claude Opus 4.6 · 50% reliability (once)

1.2 hr

Claude Opus 4.6 · 80% reliability (dependably)

1 min

1 hr

work-day

work-week

work-month

2019

2021

2023

2025

2027

GPT-4 · 4 min

o3 · 2 hr

Claude Opus 4.6 · 12 hr

AnthropicOpenAIGoogleOur projection · ~4.2-mo doubling

Log scale: each gridline is a 10× jump, so steady doubling shows up as a straight climb — the cleanest way to read the trend.

a full work-day · 8 hr

Reached

Claude Opus 4.6 clears it at 50% reliability

a work-week · 40 hr

Sep 2026

range Aug 2026 – Dec 2026

a work-month · ≈167 work-hours

Jun 2027

range Mar 2027 – Jan 2028

Data: every point is METR's measured 50% and 80% time horizon (METR-Horizon-v1.1). Source: METR. Trend & extrapolation are ours, on METR's data.

The fork · When, three ways

Same data. The rate is the whole argument.

work-day

work-week

work-month

2026

2027

2028

2029

Today · 12 hr

Stalled · linearCurrent trendRecursive self-improvement

Log scale: the trend is a straight line, the stalled path bends down, the RSI path bends up, so the three rate regimes read at a glance.

Stalled · linear

Sep 2032off the chart

Compounding stops — today's absolute pace (~3.9 min/day) just held flat.

Current trend

Jun 2027

METR's measured ~4.2-month doubling simply continues.

Recursive self-improvement

Jan 2027

The doubling time itself halves every ~12 months — an intelligence-explosion sketch.

What the dates actually buy you

A date is useless until you know what to do with it.

1Reached
Hand a model a task that would take an expert a full day, and it finishes it — about half the time.
What it unlocks
The unit you delegate jumps from a question to a whole task. But at 50% reliability your job flips from doing the work to checking it, so the value migrates to whatever catches the other half — eval harnesses, diff review, sandboxes, replay.
How to position
Bet on the verification layer. Generation stopped being the bottleneck; trusting the output is the new one. Tooling priced on a human in every loop starts to look mispriced.
Bottleneck shifts →verification & trust
2≈ Sep 2026Aug 2026 – Dec 2026
A model sustains a week of expert work in one push — multi-day projects, not single tasks.
What it unlocks
Now you delegate a project, not a task: an agent can hold a week-long goal, sequence its own sub-tasks, and recover from its own mistakes across days. The work that survives is framing the goal and judging the result — taste and specification, not execution.
How to position
The org chart becomes the product. Whoever turns one operator into a manager of agents takes the seat — and headcount-priced software wobbles hardest here, because one person now ships a team’s output.
Bottleneck shifts →orchestration & taste
3≈ Jun 2027Mar 2027 – Jan 2028AGI proxy
A model runs a month of expert work end to end — the length of a real job’s deliverable cycle. This is the bar Legg’s definition points at.
What it unlocks
The question stops being which tasks and becomes which jobs. Whole functions — a research desk, a junior dev team, a paralegal pool — can run as a service rather than a headcount.
How to position
When cognition is cheap, the constraint moves off cognition — onto what doesn’t scale with model quality: the compute and energy to run it, the trust and liability when it’s wrong, and proprietary data nobody can reproduce. Owning the scarce complement beats owning the model.
Bottleneck shifts →compute, energy & liability

Delta log

Every major capability event, with how it moved our range, and why.

No changemediumMay 4, 2026
Clark publishes 60% RSI-by-2028 thread
Anthropic co-founder Jack Clark, after a multi-week internal-data review, publicly assigned a 60% probability to recursive self-improvement occurring before end of 2028. Pere's range — end of 2028 to end of 2030 — already absorbs Clark's call: he's a forcing function on the low side, not a new data point that moves the curve.
Source
Pulled forwardmediumFeb 18, 2026
Hassabis tightens to "within five years" at India Summit
Six weeks after a more conservative "five to 10 years" line at Davos, Hassabis pulled in to "AGI is on the horizon, maybe within the next five years" at the India AI Impact Summit. The shift narrows the public DeepMind position toward Legg's 2028 anchor and toward Pere's 2028–2030 range. We treat this as a meaningful pull-forward from the most cautious frontier-lab CEO.
Source

Every forecast, sourced

Each estimate above is a real, public claim. Here's the verbatim quote, the date, and the link.

Google DeepMind
Shane LeggCo-founder & Chief AGI Scientist, Google DeepMindMedian 2028
“I think there's a 50% chance that we have AGI by 2028. Now, it's just a 50% chance.”
Dwarkesh Patel interview (Oct 2023)Source
Demis HassabisCEO, Google DeepMind2029–2031
“Now in 2026, we're at another threshold moment where AGI is on the horizon, maybe within the next five years.”
India AI Impact Summit (Feb 2026)Source
Anthropic
Dario AmodeiCEO, Anthropic2026–2030
“My basic prediction is that powerful AI could come as early as 2026, though there are also ways it could take much longer.”
Machines of Loving Grace (Oct 2024)Source
Jack ClarkCo-founder, AnthropicMedian 2028
“I think there's a 60% chance that recursive self-improvement (RSI) will occur before the end of 2028.”
X thread (May 4 2026)Source
OpenAI
Sam AltmanCEO, OpenAI2027–2032
“It is possible that we will have superintelligence in a few thousand days; it may take longer, but I'm confident we'll get there.”
The Intelligence Age (Sep 2024)Source

How fast is capability actually moving?

Same data. The rate is the whole argument.

A date is useless until you know what to do with it.

Hand a model a task that would take an expert a full day, and it finishes it — about half the time.

A model sustains a week of expert work in one push — multi-day projects, not single tasks.

A model runs a month of expert work end to end — the length of a real job’s deliverable cycle. This is the bar Legg’s definition points at.

Delta log

Clark publishes 60% RSI-by-2028 thread

Hassabis tightens to "within five years" at India Summit

Every forecast, sourced

Google DeepMind

Anthropic

OpenAI

Methodology

How fast is capability actually moving?

Same data. The rate is the whole argument.

A date is useless until you know what to do with it.

Hand a model a task that would take an expert a full day, and it finishes it — about half the time.

A model sustains a week of expert work in one push — multi-day projects, not single tasks.

A model runs a month of expert work end to end — the length of a real job’s deliverable cycle. This is the bar Legg’s definition points at.

Delta log

Clark publishes 60% RSI-by-2028 thread

Hassabis tightens to "within five years" at India Summit

Every forecast, sourced

Google DeepMind

Anthropic

OpenAI

Methodology