YOU ARE VIEWING ONE ITEM FROM THE AICRIER FEED

AI task horizons double every 7 months, METR finds

AICrier tracks AI developer news across Product Hunt, GitHub, Hacker News, YouTube, X, arXiv, and more. This page keeps the article you opened front and center while giving you a path into the live feed.

// WHAT AICRIER DOES

7+

TRACKED FEEDS

24/7

SCRAPED FEED

Short summaries, external links, screenshots, relevance scoring, tags, and featured picks for AI builders.

AI task horizons double every 7 months, METR finds
OPEN LINK ↗
// 74d agoRESEARCH PAPER

AI task horizons double every 7 months, METR finds

METR's research shows frontier AI agents have been doubling the length of tasks they can complete autonomously every 7 months since 2019, with recent acceleration to a 4-month doubling rate. At this trajectory, agents handling month-long software tasks could arrive within 2-4 years.

// ANALYSIS

This is the Moore's Law moment for AI agency — a clean exponential curve that makes vague AI hype claims concrete and measurable.

  • METR tested frontier models across ~230 tasks, finding R²=0.83 correlation between task length and agent success — unusually tight for AI benchmarks
  • Current frontier models (e.g. Claude 3.7 Sonnet) succeed on tasks taking humans a few minutes but fail 90%+ of the time on 4-hour tasks — the "why isn't AI replacing me yet" gap explained
  • 7-month doubling since 2019, now accelerating to 4 months, suggests the 2026-2027 window is when multi-day autonomous task completion becomes routine
  • The self-referential implication is stark: if agents can automate AI research, the doubling rate itself could accelerate — METR explicitly flags this flywheel risk
  • Benchmark R² of 0.83 is high but not predictive of discontinuous jumps; a single architectural breakthrough could shatter the curve in either direction
// TAGS
llmagentbenchmarkresearch

DISCOVERED

74d ago

2026-03-14

PUBLISHED

78d ago

2026-03-10

RELEVANCE

8/ 10

AUTHOR

EchoOfOppenheimer