AI Timeline
What shipped, when, and why it mattered.
Major launches, model releases, and product announcements across the AI landscape — from ChatGPT going mainstream to the rise of autonomous agents. Filter by org or read it all.
56 events
Claude Design — visuals, prototypes, and decks from conversation
Anthropic launches Claude Design, an experimental product for creating visual work: prototypes, presentation decks, one-pagers, and UI mockups. Built on Opus 4.7. Users describe what they need, Claude builds a first version, then they refine through conversation or inline edits. Reads a team's design system and codebase for brand consistency. Exports to PDF, URL, PPTX, or Canva. Available to Pro, Max, Team, and Enterprise subscribers.
Claude Opus 4.7 — sharper vision, better coding, self-verification
Opus 4.7 upgrades the flagship model with improved agentic coding, multidisciplinary reasoning, scaled tool use, and computer use. New xhigh effort level gives finer latency vs. reasoning control. Maximum image resolution jumps from 1.15MP to 3.75MP (3.3× increase). Pricing unchanged at $5/$25 per million tokens. Available across Claude products, API, Bedrock, Vertex AI, and Microsoft Foundry.
Muse Spark Shopping — AI-native commerce across Meta apps
Meta embeds a shopping experience directly into Muse Spark, rolling out across WhatsApp, Instagram, Facebook, and Messenger. Users get outfit suggestions, room styling help, and gift recommendations in-conversation. First major commerce integration built on a foundation model rather than a separate product layer — a template for how AI interfaces could replace traditional e-commerce flows.
GPT-5.4-Cyber — security-focused model with tiered access
OpenAI releases GPT-5.4-Cyber to vetted researchers and security teams, with access tiers that loosen restrictions for verified practitioners doing defensive research. Designed for threat modeling, vulnerability analysis, and exploit documentation — with guardrails calibrated by verification level rather than a single policy. Released in direct response to Anthropic's Mythos Preview from Project Glasswing.
Claude for Word — native sidebar in Microsoft Word
Claude launches as a native sidebar add-in for Microsoft Word on Mac and Windows. Highlights passages, rewrites sections, and inserts edits as tracked changes using Word's existing review workflow. Completes Claude's integration across the full Office suite (Excel, PowerPoint, Word). Available to Team and Enterprise plans.
Ultraplan — cloud-powered planning for Claude Code
Claude Code gains Ultraplan: a cloud-powered planning mode that uses Claude on the web to generate comprehensive implementation plans before coding begins. Designed for complex, multi-file tasks where getting the architecture right up front saves hours of rework.
Advisor strategy — Opus intelligence at Sonnet prices
Anthropic introduces the advisor tool: pair a fast executor model (Sonnet or Haiku) with Opus as a strategic advisor that only gets called on hard decisions. Sonnet + Opus advisor improved SWE-bench Multilingual by 2.7 percentage points while cutting per-task cost by 11.9%. A new paradigm for cost-effective agent intelligence.
Ask Your Org — org-wide knowledge search across Slack, email, Drive
Anthropic launches Ask Your Org: a pre-configured Project that searches across connected company tools (Slack, Microsoft 365, Google Workspace, custom MCP connectors) and returns a single synthesized answer with citations. Permission-aware — users only see data they can already access. Available to Team and Enterprise plans after owner setup.
New admin controls — user groups, spend limits, Compliance API
Anthropic ships a batch of admin controls for Team and Enterprise: user groups with SCIM sync, role-based access defining which Claude features each group can use, per-user spend caps, managed Claude Code policies (tool/file/MCP permissions), and a new Compliance API for Enterprise giving programmatic access to usage data and selective deletion.
Claude Cowork generally available — enterprise-ready
Cowork goes GA on macOS and Windows with Analytics API access, OpenTelemetry monitoring, and role-based access controls for enterprise departments. The shift from collaborative experiment to production-grade team workspace.
Monitor tool — background streaming in Claude Code
Claude Code gains the Monitor tool: spawn a background process and stream its stdout into the conversation without blocking the thread. Enables patterns like "watch kubectl logs for errors and fix any crashes" — a step toward always-on agent awareness.
Muse Spark — Meta's first proprietary model
Meta launches Muse Spark, its first proprietary (non-open-source) model, developed by Meta Superintelligence Labs. Small and fast, competitive on reasoning and agentic tasks. Signals a strategic shift: Meta now has both open (Llama) and closed model lines.
Claude Managed Agents — autonomous agents via API
Anthropic launches Managed Agents in public beta: a fully managed harness for running Claude as an autonomous agent with secure sandboxing, built-in tools, and streaming. Create agents, configure containers, and run sessions entirely through the API. The biggest shift from Claude-as-assistant to Claude-as-worker.
Project Glasswing — defensive cybersecurity coalition
Anthropic announces Project Glasswing alongside AWS, Apple, Google, Microsoft, NVIDIA, and others to secure critical software infrastructure. Claude Mythos Preview — a specialized cybersecurity model — available as a gated research preview for defensive work.
AI Edge Eloquent — offline-first AI dictation for iOS
Google quietly ships AI Edge Eloquent, an on-device dictation app for iOS using Gemma-based speech recognition models. Works fully offline; optional Gemini cloud integration for post-processing. Signals Google's push into private, on-device AI — where inference stays on the hardware rather than hitting a remote API.
Gemma 4 — natively multimodal open model family
Google releases Gemma 4, an open model family from 2.3B to 31B parameters that is natively multimodal (text, image, video). The 31B Dense variant ranks #3 globally among open models. A major leap in what open-weights models can do.
MAI-Transcribe-1, MAI-Voice-1, MAI-Image-2 — three in-house models
Microsoft announces three new proprietary MAI foundation models available in Azure AI Foundry: MAI-Transcribe-1 (state-of-the-art multilingual speech recognition), MAI-Voice-1 (custom voice synthesis), and MAI-Image-2 (top Arena.ai leaderboard scores, 2× faster generation). First clear signal that Microsoft is building its own model stack alongside its OpenAI and Anthropic partnerships.
1M token context window — generally available
The 1M token context window is now GA for Claude Opus 4.6 and Sonnet 4.6 at standard pricing — no beta header required. Requests over 200k tokens work automatically. Also raised the media limit from 100 to 600 images or PDF pages per request.
Claude Partner Network — $100M enterprise adoption push
Anthropic launches the Claude Partner Network with $100M invested to accelerate enterprise adoption. Targets system integrators, consultants, and implementation partners helping organizations deploy Claude at scale.
Claude Sonnet 4.6 — frontier performance at everyday speed
Sonnet 4.6 launches as the balanced model for professional work at scale — improved agentic search, fewer tokens consumed, extended thinking support, and 1M token context window. Web search and code execution tools hit general availability on the same day.
Series G — $30 billion at $380B valuation
Anthropic raises $30B in Series G funding at a $380B post-money valuation, with $14B annual run-rate revenue. One of the most valuable private companies in history. Signals market conviction that frontier AI will be defining infrastructure.
Claude Opus 4.6 — flagship for long-horizon agents
Opus 4.6 launches for complex, long-horizon agentic tasks. Introduces adaptive thinking (replacing manual budget_tokens), the compaction API for effectively infinite conversations, and data residency controls. Fast mode — up to 2.5x faster — available in preview.
Claude Labs — experimental features platform
Anthropic launches Labs, a platform for experimental Claude features before they reach the main product. First move toward a formal beta program for early access to capabilities in development.
Claude for Healthcare & Life Sciences — HIPAA-ready
Anthropic launches HIPAA-ready infrastructure with clinical trial connectors for healthcare and life sciences. Opens Claude to regulated industries that previously had compliance blockers.
Copilot for M365 — broad enterprise rollout
Microsoft Copilot reaches general availability across M365 enterprise. AI embedded in Word, Excel, PowerPoint, Outlook, and Teams. Forces the question: if Copilot is good enough for document work, what does Claude add?
MCP donated to the Agentic AI Foundation
Anthropic donates the Model Context Protocol to the newly formed Agentic AI Foundation, cementing MCP as an open industry standard. Adopted by dozens of companies since its release, MCP becomes a neutral, community-governed protocol.
Claude.ai Cowork — shared workspaces for teams
Real-time collaborative sessions in Claude — multiple users working in the same conversation. Foundation for team-based AI workflows.
Claude Opus 4.5 — step-change in vision and coding
Opus 4.5 launches as the most capable Claude model yet — step-change improvements in vision, coding, and computer use at a more accessible price than previous Opus models. Best model for complex specialized tasks and professional software engineering.
Claude in Microsoft Foundry — Azure integration
Claude models come to Azure customers through Microsoft Foundry with Azure billing and OAuth authentication. Full Messages API access including extended thinking, prompt caching, PDF support, and Agent Skills.
ChatGPT canvas + memory by default
OpenAI ships long-term memory as a default ChatGPT feature, and launches canvas — a structured writing and code workspace. Accelerates the shift from chatbot to persistent AI workspace.
Agent Skills — Claude works with Office files natively
Agent Skills launch in beta: pre-built Skills for PowerPoint, Excel, Word, and PDF files. Also supports custom Skills where you package your own domain expertise. Claude can now read and write Office documents without manual parsing — a major unlock for enterprise workflows.
Claude Haiku 4.5 — fastest model with near-frontier intelligence
Haiku 4.5 launches as the fastest and most capable Haiku yet — near-frontier performance for real-time applications, high-volume processing, and cost-sensitive deployments where speed matters more than maximum intelligence.
Claude Sonnet 4.5 — built for complex agents and coding
Sonnet 4.5 launches with the highest intelligence of any Sonnet, built for complex agent workflows and coding tasks. Also ships: the memory tool (persistent context across conversations) and context editing for automatic conversation management.
Web fetch tool — Claude reads any webpage
Web fetch tool launches in beta, letting Claude retrieve full content from any web page or PDF by URL. Pairs with web search for end-to-end research tasks.
Claude Code — agentic development in the terminal
Claude Code launches as a terminal-native development experience: reads codebases, runs commands, writes and edits files, and navigates full projects. Goes further than autocomplete into agentic development.
Claude Opus 4 + Sonnet 4 — the Claude 4 family launches
Anthropic ships the first Claude 4 models: Opus 4 for frontier tasks and Sonnet 4 for everyday use, both with extended thinking. Also ships: Files API, Code Execution tool, and MCP connector in the API. A step-change in what Claude can do in production.
Web search — Claude accesses the live internet
Web search launches in the API, giving Claude access to up-to-date information. Closes the biggest gap between Claude and web-native AI products.
GPT-4o — natively multimodal with real-time voice
GPT-4o ships as a single model handling voice, image, and text without switching modes. Real-time voice with sub-second latency. Brings the conversational AI interaction model much closer to reality.
LLaMA 4 — frontier open-weights model
Meta releases LLaMA 4 with performance competitive with GPT-4-class models. Largest open-weights release to date. Reinforces that the gap between open-source and proprietary is closing faster than expected.
Claude Sonnet 3.7 — extended thinking in production
Claude Sonnet 3.7 ships as the first Claude model with extended thinking built in — a mode where Claude reasons step-by-step before answering. Near-instant responses or visible thinking, one model. Anthropic's first mainstream reasoning model.
Agentic AI enters mainstream product conversation
The term "AI agents" crosses from research into mainstream product discussion. Every major AI lab ships or announces agentic products. The question shifts from "can AI do this?" to "how do you orchestrate AI to do this reliably?"
Citations API — source attribution for document work
Claude gains the ability to cite its sources when answering from documents — pointing to the exact passage it drew from. Major unlock for trust in enterprise document workflows and RAG applications.
Gemini Advanced with Deep Research
Google ships Deep Research in Gemini Advanced — a mode that autonomously researches a topic across the web over several minutes and returns a structured report. First mainstream implementation of a multi-step research agent.
Model Context Protocol (MCP) — open standard for tool connections
Anthropic releases MCP as an open protocol for connecting AI models to external tools and data sources. Other companies adopt it quickly, making MCP a de-facto standard. The shift from proprietary to standardized AI integrations.
Batches, Token Counting, Prompt Caching — all go GA
Anthropic moves several API features to general availability: Message Batches API (50% cost reduction on batch jobs), Token Counting API, Prompt Caching (90% cost reduction), and PDF support. Also ships Go and Java SDKs.
Claude Haiku 3.5 — fast, affordable, upgraded
Claude Haiku 3.5 launches as a fast, cost-efficient model for real-time applications and high-volume tasks. Better than Haiku 3 at a similar price point.
Computer Use — Claude controls a desktop
Anthropic ships Computer Use in public beta: Claude can move a mouse, click, type, and navigate GUI applications. First mainstream API for AI-controlled computer interaction. Sets the foundation for desktop-level automation.
o1 — reasoning model with explicit chain-of-thought
OpenAI releases o1, a model that reasons explicitly before answering. Significantly outperforms GPT-4 on math and science benchmarks. Introduces "reasoning model" as a distinct category from "chat model."
Console Workspaces — project-level API management
Workspaces launch in the Developer Console: custom spend limits, grouped API keys, usage tracking by project, and user roles. First real admin layer for teams using the Claude API.
Prompt Caching — 90% cost reduction on repeated context
Prompt caching launches in beta: cache and reuse prompt content to reduce costs by up to 90% and latency by up to 80%. Major cost unlock for applications using the same system prompt or documents repeatedly.
Claude Sonnet 3.5 + Artifacts — a new interaction model
Claude Sonnet 3.5 launches with benchmark-leading coding performance. Artifacts ship alongside: a side-panel for code, documents, and interactive outputs. The shift from text responses to living, editable artifacts.
Claude 3 family — Haiku, Sonnet, Opus
Anthropic ships the Claude 3 model family: three tiers for speed vs. capability tradeoffs. Opus leads on benchmarks. Haiku enables real-time, cost-efficient use cases. Establishes the tiered naming convention still in use today.
Gemini 1.0 Ultra — Google's frontier model
Google rebrands Bard to Gemini and ships the Ultra tier. Natively multimodal from architecture rather than retrofit. Signals Google's intent to compete directly with GPT-4 and Claude at the frontier.
Claude 1 — Constitutional AI in production
Anthropic ships the first public Claude, trained using Constitutional AI — aligning model behavior using a set of principles rather than purely human feedback. First major alternative to GPT at capability scale.
GPT-4 — multimodal, significantly improved reasoning
GPT-4 launches with vision capabilities and measurably better reasoning than GPT-3.5. Triggers the "AI moment" for enterprise adoption.
ChatGPT — the moment AI went mainstream
ChatGPT launches and reaches 100 million users in two months — the fastest consumer product adoption in history. Not the most capable model at the time, but the one that made AI accessible to everyone. Everything since has been a response to this moment.
This timeline covers launches relevant to operators, founders, and teams building with AI — not an exhaustive record of every model release or research paper. Focus is on things that changed what was practically possible. Updated as things ship.