AI Timeline
What shipped, when, and why it mattered.
Major launches, model releases, and product announcements across the AI landscape — from ChatGPT going mainstream to the rise of autonomous agents. Filter by org or read it all.
94 events
Claude Code v2.1.149 — /usage cost breakdown, enterprise MCP setting
Claude Code v2.1.149 ships two practitioner-relevant features: /usage, a new command that shows per-category token cost breakdowns mid-session (context vs. tools vs. output vs. cache); and allowAllClaudeAiMcps, an enterprise setting that blanket-approves all Claude.ai cloud MCP connectors without per-server admin review. Also in the v2.1.147-v2.1.152 range: /simplify renamed to /code-review, /code-review --fix auto-applies findings, Auto mode runs without a per-session consent prompt, and VS Code 1.121 adds remote agent sessions that survive laptop disconnect.
Project Glasswing: 10,000 vulnerabilities found in first month
Anthropic publishes the first update on Project Glasswing — its controlled-access cybersecurity initiative using the unreleased Claude Mythos Preview model. In the first month, Glasswing partners discovered 10,000+ software vulnerabilities. The open-source scan alone (1,000+ widely used projects) flagged 6,202 high/critical vulnerabilities with a 90.6% confirmed true-positive rate. Partners include AWS, Apple, Cisco, CrowdStrike, Google, JPMorganChase, Microsoft, NVIDIA, Palo Alto Networks, and now Verizon.
XAI acquires Cursor — $60B valuation, Composer 2.5
Elon Musk's XAI (backed by SpaceX compute infrastructure) acquires Cursor at roughly $60 billion with a $10 billion opt-out clause. The compute access enables Composer 2.5, a new coding model that is significantly faster and cheaper than equivalents on Claude or Codex — early reports put it at near-frontier quality for front-end work at a fraction of the token cost. Cursor announces intent to become a full 'coding and knowledge work' platform, adding automations and a marketplace alongside its existing in-app browser.
Claude Compliance API + 28 enterprise security integrations
Anthropic ships the Claude Compliance API and 28 launch-partner integrations across DLP (Microsoft Purview, Nightfall, Forcepoint), SIEM (Splunk, Securonix, Sumo Logic), CASB (Netskope, Zscaler), identity (Okta, CrowdStrike Falcon), eDiscovery (Relativity, Logikcull), and AI security posture (Wiz, Orca, Lacework). Enterprise admins get programmatic access to all Claude usage data — request-level prompts and responses — for compliance, eDiscovery, and security monitoring. The Compliance API and Security Ingestion API are distinct: compliance routes data to legal/HR, security routes alerts to SOC tooling.
Codex adds /goal mode and plugin sharing
OpenAI ships two meaningful Codex updates: /goal mode, which lets agents run autonomously for hours (or more than a day) toward a stated high-level objective rather than stopping after each command; and plugin sharing, which lets team members publish workflow plugins to the entire workspace. Design mode (screenshot + annotation for UI iteration) also ships. Together these move Codex significantly closer to a unified super-app that handles both coding and long-horizon knowledge work.
Google I/O 2026 — Gemini Spark preview, Gemini 3.5 Flash
Google I/O 2026's AI announcements: Gemini Spark (leaked/previewed) — a new agentic surface inside the Gemini app with virtual computers, long agent trajectories, canvas, deep research, and connections across Google Workspace. Gemini 3.5 Flash releases but falls short of frontier expectations — widely noted as faster but not competitive with GPT-5.5 or Claude Opus 4.7 on complex reasoning. Gemini Omni video model also announced but trails Sora and Kling. Google's main challenge: no clear unified super-app strategy (Gemini vs. AI Studio vs. Project IDX).
Andrej Karpathy joins Anthropic
Karpathy — founding OpenAI member, former Tesla Autopilot lead, and the foremost AI educator on the internet — announces he has joined Anthropic to get back to R&D. The hire is widely read as a significant talent signal: CTOs of publicly traded companies and top researchers across the industry have been choosing individual contributor roles at Anthropic over running organizations. Karpathy's publicly stated focus: the next few years of LLM development will be "especially formative."
KPMG global alliance — 276,000-person workforce deployment
KPMG signs a global alliance with Anthropic to deploy Claude across its 276,000-person workforce and embed Claude Cowork and Managed Agents into KPMG's Digital Gateway platform. Launch use cases: tax, legal, private equity. KPMG also announces KPMG Blaze — a product built on Claude Code for modernizing legacy IT systems inside private equity portfolio companies. Second Big Four firm to go all-in on Claude in one month, following PwC's announcement on May 14.
claude agents — parallel multi-agent terminal mode in Claude Code
Anthropic ships `claude agents`, a new terminal mode that lets users run multiple independent agent tasks simultaneously. Each task runs as its own agent; keyboard navigation moves between them. Distinct from the desktop parallel sessions sidebar — this is the headless/terminal-native equivalent. Enables parallel research, builds, and pipelines without GUI.
Anthropic acquires Stainless — SDK and MCP generation toolchain
Anthropic acquires Stainless for $300M+. Stainless automates generation of SDKs, CLIs, and MCP servers from API specs, and previously generated all official Anthropic, OpenAI, Google, Cloudflare, and Meta SDKs. Bringing Stainless in-house means Anthropic now controls the SDK pipeline and MCP server generation toolchain. All hosted Stainless products are being wound down; existing customers keep their generated SDKs.
Claude for Small Business — pre-built connectors and workflows
Anthropic launches Claude for Small Business: a bundle of pre-built connectors and ready-to-run workflows targeting companies without a dedicated IT or AI team. Seven integrations at launch: QuickBooks, PayPal, HubSpot, Canva, DocuSign, Google Workspace, Microsoft 365. Claude can read live data from connected tools and perform tasks without manual copy-paste. Available on Claude Pro and higher.
OpenAI launches The Deployment Company — $10B FDE venture
OpenAI launches a majority-owned deployment company with McKinsey, Bain, and Capgemini, funded at $10B+. Simultaneously acquires Tomoro, a 150-person forward deployed engineering firm, to staff it immediately. The Deployment Company puts OpenAI engineers inside enterprise clients to build production AI systems against their actual infrastructure — a direct copy of Palantir's FDE model. Launched within days of Anthropic's $1.5B JV with Blackstone and Goldman Sachs.
Android Show 2026 — Gemini Intelligence across Android
Google's Android Show 2026 centers on Gemini Intelligence: on-device AI that automates complex tasks, summarizes web content, fills forms, and includes Rambler (polishes voice messages or builds custom widgets from natural language). New Googlebooks AI-first laptops announced. Gemini comes to Chrome. Rolling out to Samsung and Pixel phones first, then broader Android. Signals the shift from Gemini-as-chatbot to Gemini-as-OS-layer.
Claude Platform on AWS — generally available
Anthropic's native Claude Platform experience arrives on AWS as a GA service — the first cloud provider to offer it. AWS customers get the full Claude API (Messages, Files, Batches, Managed Agents, Agent Skills, code execution, web tools) through native AWS endpoints, unified with existing AWS billing, authentication, and security controls. No separate Anthropic contract needed. Anthropic simultaneously commits $100B+ to AWS over ten years.
Managed Agents: Dreaming, Outcomes, multi-agent orchestration
Anthropic ships three new capabilities for Managed Agents. Dreaming: agents review their own past sessions, extract patterns, and self-improve over time. Outcomes: a separate grading agent scores completed tasks and re-runs them until they hit quality thresholds — lifted document generation quality 10.1% on benchmarks. Multi-agent orchestration: a lead agent breaks jobs into pieces and delegates each to a specialist with its own model, prompt, and tools running in parallel on a shared filesystem.
GPT-5.5 Instant — new default ChatGPT model
OpenAI ships GPT-5.5 Instant as the default model for all ChatGPT users, replacing GPT-5.3 Instant. Produces 52.5% fewer hallucinated claims on high-stakes prompts (medicine, law, finance). Can search past conversations, files, and Gmail for personalized answers. GPT-5.3 Instant remains available to paid users for three months before retirement.
Claude finance agents — 10 reference agents for financial services
Anthropic ships 10 ready-made agent templates aimed at financial services workflows: Pitch builder, Meeting preparer, Earnings reviewer, Model builder, Market researcher, Valuation reviewer, General ledger reconciler, Month-end closer, Statement auditor, and KYC screener. Each is a reference architecture bundling skills (task instructions + domain knowledge), connectors (governed access to data), and subagents. Available as plugins for Claude Cowork and Claude Code, and as cookbook snippets for Claude Managed Agents.
Enterprise AI services company — Anthropic + Blackstone + Goldman
Anthropic, Blackstone, Hellman & Friedman, and Goldman Sachs announce a new AI services company targeting mid-sized businesses. The joint venture deploys Claude into core operations across sectors — the first major move by Anthropic to own the services layer, not just the model.
Enterprise AI services JV with Blackstone, Goldman Sachs, and Hellman & Friedman
Anthropic announces a new joint venture with Blackstone, Goldman Sachs, and Hellman & Friedman to build an enterprise AI services company. The venture will help large companies integrate Claude into core business operations — sales, finance, ops, customer support — with funded engagement teams and prebuilt playbooks. Signals Anthropic going further down the services + integration path, not just the model + API path.
Microsoft Agent 365 and Microsoft 365 E7 — GA
Microsoft ships Agent 365 ($15/user/mo) and the Microsoft 365 E7 "Frontier Suite" ($99/user/mo, bundling M365 E5 + Copilot + Agent 365). Agent 365 is a control plane that discovers, governs, and secures AI agents across an enterprise — Microsoft's own Copilot agents, third-party SaaS agents, AWS Bedrock and Google Cloud agents, plus local agents on Windows endpoints including Claude Code, GitHub Copilot CLI, and OpenClaw. Aimed at IT/security teams managing "agent sprawl."
$700 billion — Big Tech AI infrastructure spending in 2026
Microsoft, Google, Meta, Amazon, and others are on track to collectively spend ~$700 billion on AI infrastructure in 2026, nearly double 2025 levels. Meta alone guided $115–135B in capex. No clear ceiling: every major hyperscaler is accelerating, treating AI compute as the defining infrastructure race of the decade.
ChatGPT Advanced Account Security
OpenAI rolls out Advanced Account Security for ChatGPT: phishing-resistant sign-in, tighter recovery controls, shorter sessions, login alerts, and automatic exclusion from training. Required by June 1, 2026 for individuals using OpenAI's most cyber-capable models under its Trusted Access for Cyber program. Tightens the consumer surface that has lagged enterprise IDP integrations on most LLM products.
Claude Security — public beta for Enterprise
Anthropic moves Claude Security from research preview into public beta for Claude Enterprise. Built on Opus 4.7. Scans repositories, traces data flows across files and modules, and produces vulnerability findings with proposed patches you review and apply. Three scan modes (scheduled, targeted, ad-hoc); findings export to Slack, Jira, CSV, or Markdown. Started life as Claude Code Security in February. Team and Max access on the roadmap.
Claude API skill expands to JetBrains, Warp, CodeRabbit, and Resolve AI
The claude-api skill — an open-source bundle of current API patterns, model parameters, prompt caching strategies, and SDK updates — now ships inside JetBrains IDEs (with Junie), Warp terminal, CodeRabbit PR review, and Resolve AI. Originally released in Claude Code in March, the skill keeps developer tools current with new model launches like Opus 4.7 without manual doc lookups. Reduces stale-API errors in production code and surfaces guided migrations inline.
Claude for Creative Work — connectors for Adobe, Blender, Ableton, and more
Anthropic launches nine connectors for professional creative software: Adobe Creative Cloud, Blender, Ableton, Autodesk Fusion, Affinity by Canva, SketchUp, Resolume, and Splice. Claude can now tutor, extend via scripting, and bridge across creative pipelines. Three university partnerships (RISD, Ringling, Goldsmiths) anchor an education push. Distinct from Claude Design (April 17), which generates visuals on its own.
Microsoft ends OpenAI exclusivity — opens to Google Cloud and AWS
Microsoft and OpenAI renegotiate their partnership: OpenAI can now license its models across multiple clouds including Google Cloud and Amazon Web Services. Microsoft loses exclusive IP access in exchange for a capped revenue share through 2030 and a nonexclusive license to OpenAI IP through 2032. Removes a structural barrier that had kept GPT models off competing cloud platforms. Amazon confirms it will make OpenAI models available on Bedrock shortly.
Agent memory — persistent state for Managed Agents
Anthropic ships agent memory in public beta under the Managed Agents API. Agents can now store and retrieve facts, context, and past decisions across sessions — without the developer managing a custom memory layer. Completes the foundational stack for stateful production agents.
GPT-5.5 — next step toward an AI super-app
OpenAI releases GPT-5.5 and GPT-5.5 Pro, framing it as a step toward a unified AI 'super-app.' Available via API (as chat-latest) from April 24. Positions ChatGPT closer to a full-featured work assistant rather than a pure chat interface.
GPT-5.5 — unified multimodal model, 84.9% on GDPval, 1M context
OpenAI ships GPT-5.5 and GPT-5.5 Pro: a single unified architecture handling text, images, audio, and video end-to-end. Co-designed with NVIDIA GB200/GB300 rack systems to match GPT-5.4 latency despite significant capability gains. Scores 84.9% on GDPval (44-occupation knowledge work benchmark), 78.7% on OSWorld-Verified (real computer operation), and 90.1% on BrowseComp. API access followed on April 24. Supports 1M token context in the API.
Agentic web standard — open protocol for AI agents to transact
Microsoft, Google, and Shopify co-publish an open standard for AI agents to discover products and transact on users' behalf, backed by Visa, Mastercard, and 20+ others. First shared infrastructure spec for agentic commerce — defining how agents connect to merchants, authenticate, and complete purchases. Sets the ground rules for a future where AI assistants shop independently.
Google-Apple partnership — Gemini powers next-gen Siri
Google confirmed it is Apple's preferred cloud AI partner: Gemini will power the context-aware capabilities in the next major Siri upgrade. Gives Google model reach into every iPhone without a consumer product of its own — and signals that Apple Intelligence's real bet is on partnered foundation models, not Apple-built ones.
Copilot agentic capabilities in Word, Excel, PowerPoint — GA
Microsoft ships agentic Copilot across all three core Office apps at GA. Multi-step, in-app actions now work natively: Word drafts and restructures documents in one flow; Excel builds analysis, formulas, and charts; PowerPoint updates existing decks with new data while respecting company templates. Takes Copilot from text suggestion to document-level action for all M365 enterprise customers.
Gemini Enterprise Agent Platform — GA at Google Cloud Next
Google ships the Gemini Enterprise Agent Platform at Google Cloud Next '26: a unified product to build, deploy, govern, and monitor AI agents at scale. Includes an Agent Designer UI, activity inbox, and pre-built agents from Salesforce, ServiceNow, Workday, Adobe, and Atlassian. Paired with a $750M partner fund to accelerate enterprise adoption.
ChatGPT Images 2.0 — reasoning before drawing
OpenAI ships gpt-image-2 with two modes: Instant (fast generation) and Thinking (browses for references, reasons through the prompt, self-verifies before rendering). Significantly better at dense text, precise object placement, and multi-language images. Available in ChatGPT and via API. Thinking mode closes a longstanding gap in instruction-following quality for complex visual tasks.
Codex expanded — agentic workspace with browsing, computer use, and memory
OpenAI expands Codex from code tool to full agentic workspace: computer use, in-app browsing, image generation via gpt-image-2, memory, and plugins ship in the desktop app. Also launches Codex Labs — a developer training and certification program with GSI partnerships. Signals Codex repositioning as a general agentic surface, not just a coding assistant. Over 4 million weekly developer users.
Claude Design — visuals, prototypes, and decks from conversation
Anthropic launches Claude Design, an experimental product for creating visual work: prototypes, presentation decks, one-pagers, and UI mockups. Built on Opus 4.7. Users describe what they need, Claude builds a first version, then they refine through conversation or inline edits. Reads a team's design system and codebase for brand consistency. Exports to PDF, URL, PPTX, or Canva. Available to Pro, Max, Team, and Enterprise subscribers.
Claude Opus 4.7 — sharper vision, better coding, self-verification
Opus 4.7 upgrades the flagship model with improved agentic coding, multidisciplinary reasoning, scaled tool use, and computer use. New xhigh effort level gives finer latency vs. reasoning control. Maximum image resolution jumps from 1.15MP to 3.75MP (3.3× increase). Pricing unchanged at $5/$25 per million tokens. Available across Claude products, API, Bedrock, Vertex AI, and Microsoft Foundry.
Muse Spark Shopping — AI-native commerce across Meta apps
Meta embeds a shopping experience directly into Muse Spark, rolling out across WhatsApp, Instagram, Facebook, and Messenger. Users get outfit suggestions, room styling help, and gift recommendations in-conversation. First major commerce integration built on a foundation model rather than a separate product layer — a template for how AI interfaces could replace traditional e-commerce flows.
Agents SDK — native sandboxing and production harness
OpenAI ships a major Agents SDK update: native sandbox execution isolates agents in scoped environments (only the files, tools, and code each task needs), and a model-native harness optimized for how GPT-5.4 performs on long-horizon, multi-tool work. A Manifest abstraction standardizes workspace description from local prototype to production, with support for S3, GCS, Azure Blob, and Cloudflare R2. Python first; TypeScript planned.
GPT-Rosalind — specialized AI for life sciences research
OpenAI launches GPT-Rosalind, a frontier reasoning model fine-tuned for life sciences: genomics, protein engineering, chemistry, and drug discovery. Named after Rosalind Franklin, the model outperforms GPT-5.4 on six of eleven LABBench2 tasks and leads on BixBench bioinformatics benchmarks. Supports evidence synthesis, hypothesis generation, and experimental planning. Available as a restricted research preview to qualified Enterprise customers; launching with Amgen, Moderna, the Allen Institute, and Thermo Fisher Scientific.
Claude Code desktop redesign — parallel sessions, integrated editor
Claude Code's desktop app ships a major redesign: a sessions sidebar for running multiple tasks in parallel, an integrated terminal and file editor with faster diffs, expanded preview pane, side chat (⌘+;) for asking questions without interrupting the main session, drag-and-drop workspace, and SSH support on Mac. The shift from a single-task CLI to a multi-session coding environment.
GPT-5.4-Cyber — security-focused model with tiered access
OpenAI releases GPT-5.4-Cyber to vetted researchers and security teams, with access tiers that loosen restrictions for verified practitioners doing defensive research. Designed for threat modeling, vulnerability analysis, and exploit documentation — with guardrails calibrated by verification level rather than a single policy. Released in direct response to Anthropic's Mythos Preview from Project Glasswing.
Ultraplan — cloud-powered planning for Claude Code
Claude Code gains Ultraplan: a cloud-powered planning mode that uses Claude on the web to generate comprehensive implementation plans before coding begins. Designed for complex, multi-file tasks where getting the architecture right up front saves hours of rework.
Claude for Word — native sidebar in Microsoft Word
Claude launches as a native sidebar add-in for Microsoft Word on Mac and Windows. Highlights passages, rewrites sections, and inserts edits as tracked changes using Word's existing review workflow. Completes Claude's integration across the full Office suite (Excel, PowerPoint, Word). Available to Team and Enterprise plans.
Monitor tool — background streaming in Claude Code
Claude Code gains the Monitor tool: spawn a background process and stream its stdout into the conversation without blocking the thread. Enables patterns like "watch kubectl logs for errors and fix any crashes" — a step toward always-on agent awareness.
New admin controls — user groups, spend limits, Compliance API
Anthropic ships a batch of admin controls for Team and Enterprise: user groups with SCIM sync, role-based access defining which Claude features each group can use, per-user spend caps, managed Claude Code policies (tool/file/MCP permissions), and a new Compliance API for Enterprise giving programmatic access to usage data and selective deletion.
Ask Your Org — org-wide knowledge search across Slack, email, Drive
Anthropic launches Ask Your Org: a pre-configured Project that searches across connected company tools (Slack, Microsoft 365, Google Workspace, custom MCP connectors) and returns a single synthesized answer with citations. Permission-aware — users only see data they can already access. Available to Team and Enterprise plans after owner setup.
Advisor strategy — Opus intelligence at Sonnet prices
Anthropic introduces the advisor tool: pair a fast executor model (Sonnet or Haiku) with Opus as a strategic advisor that only gets called on hard decisions. Sonnet + Opus advisor improved SWE-bench Multilingual by 2.7 percentage points while cutting per-task cost by 11.9%. A new paradigm for cost-effective agent intelligence.
Claude Cowork generally available — enterprise-ready
Cowork goes GA on macOS and Windows with Analytics API access, OpenTelemetry monitoring, and role-based access controls for enterprise departments. The shift from collaborative experiment to production-grade team workspace.
Claude Managed Agents — autonomous agents via API
Anthropic launches Managed Agents in public beta: a fully managed harness for running Claude as an autonomous agent with secure sandboxing, built-in tools, and streaming. Create agents, configure containers, and run sessions entirely through the API. The biggest shift from Claude-as-assistant to Claude-as-worker.
Muse Spark — Meta's first proprietary model
Meta launches Muse Spark, its first proprietary (non-open-source) model, developed by Meta Superintelligence Labs. Small and fast, competitive on reasoning and agentic tasks. Signals a strategic shift: Meta now has both open (Llama) and closed model lines.
AI Edge Eloquent — offline-first AI dictation for iOS
Google quietly ships AI Edge Eloquent, an on-device dictation app for iOS using Gemma-based speech recognition models. Works fully offline; optional Gemini cloud integration for post-processing. Signals Google's push into private, on-device AI — where inference stays on the hardware rather than hitting a remote API.
Project Glasswing — defensive cybersecurity coalition
Anthropic announces Project Glasswing alongside AWS, Apple, Google, Microsoft, NVIDIA, and others to secure critical software infrastructure. Claude Mythos Preview — a specialized cybersecurity model — available as a gated research preview for defensive work.
MAI-Transcribe-1, MAI-Voice-1, MAI-Image-2 — three in-house models
Microsoft announces three new proprietary MAI foundation models available in Azure AI Foundry: MAI-Transcribe-1 (state-of-the-art multilingual speech recognition), MAI-Voice-1 (custom voice synthesis), and MAI-Image-2 (top Arena.ai leaderboard scores, 2× faster generation). First clear signal that Microsoft is building its own model stack alongside its OpenAI and Anthropic partnerships.
Gemma 4 — natively multimodal open model family
Google releases Gemma 4, an open model family from 2.3B to 31B parameters that is natively multimodal (text, image, video). The 31B Dense variant ranks #3 globally among open models. A major leap in what open-weights models can do.
1M token context window — generally available
The 1M token context window is now GA for Claude Opus 4.6 and Sonnet 4.6 at standard pricing — no beta header required. Requests over 200k tokens work automatically. Also raised the media limit from 100 to 600 images or PDF pages per request.
Claude Partner Network — $100M enterprise adoption push
Anthropic launches the Claude Partner Network with $100M invested to accelerate enterprise adoption. Targets system integrators, consultants, and implementation partners helping organizations deploy Claude at scale.
Claude Sonnet 4.6 — frontier performance at everyday speed
Sonnet 4.6 launches as the balanced model for professional work at scale — improved agentic search, fewer tokens consumed, extended thinking support, and 1M token context window. Web search and code execution tools hit general availability on the same day.
Series G — $30 billion at $380B valuation
Anthropic raises $30B in Series G funding at a $380B post-money valuation, with $14B annual run-rate revenue. One of the most valuable private companies in history. Signals market conviction that frontier AI will be defining infrastructure.
Claude Opus 4.6 — flagship for long-horizon agents
Opus 4.6 launches for complex, long-horizon agentic tasks. Introduces adaptive thinking (replacing manual budget_tokens), the compaction API for effectively infinite conversations, and data residency controls. Fast mode — up to 2.5x faster — available in preview.
Claude Labs — experimental features platform
Anthropic launches Labs, a platform for experimental Claude features before they reach the main product. First move toward a formal beta program for early access to capabilities in development.
Claude for Healthcare & Life Sciences — HIPAA-ready
Anthropic launches HIPAA-ready infrastructure with clinical trial connectors for healthcare and life sciences. Opens Claude to regulated industries that previously had compliance blockers.
Copilot for M365 — broad enterprise rollout
Microsoft Copilot reaches general availability across M365 enterprise. AI embedded in Word, Excel, PowerPoint, Outlook, and Teams. Forces the question: if Copilot is good enough for document work, what does Claude add?
MCP donated to the Agentic AI Foundation
Anthropic donates the Model Context Protocol to the newly formed Agentic AI Foundation, cementing MCP as an open industry standard. Adopted by dozens of companies since its release, MCP becomes a neutral, community-governed protocol.
Claude.ai Cowork — shared workspaces for teams
Real-time collaborative sessions in Claude — multiple users working in the same conversation. Foundation for team-based AI workflows.
Claude Opus 4.5 — step-change in vision and coding
Opus 4.5 launches as the most capable Claude model yet — step-change improvements in vision, coding, and computer use at a more accessible price than previous Opus models. Best model for complex specialized tasks and professional software engineering.
Claude in Microsoft Foundry — Azure integration
Claude models come to Azure customers through Microsoft Foundry with Azure billing and OAuth authentication. Full Messages API access including extended thinking, prompt caching, PDF support, and Agent Skills.
ChatGPT canvas + memory by default
OpenAI ships long-term memory as a default ChatGPT feature, and launches canvas — a structured writing and code workspace. Accelerates the shift from chatbot to persistent AI workspace.
Agent Skills — Claude works with Office files natively
Agent Skills launch in beta: pre-built Skills for PowerPoint, Excel, Word, and PDF files. Also supports custom Skills where you package your own domain expertise. Claude can now read and write Office documents without manual parsing — a major unlock for enterprise workflows.
Claude Haiku 4.5 — fastest model with near-frontier intelligence
Haiku 4.5 launches as the fastest and most capable Haiku yet — near-frontier performance for real-time applications, high-volume processing, and cost-sensitive deployments where speed matters more than maximum intelligence.
Claude Sonnet 4.5 — built for complex agents and coding
Sonnet 4.5 launches with the highest intelligence of any Sonnet, built for complex agent workflows and coding tasks. Also ships: the memory tool (persistent context across conversations) and context editing for automatic conversation management.
Web fetch tool — Claude reads any webpage
Web fetch tool launches in beta, letting Claude retrieve full content from any web page or PDF by URL. Pairs with web search for end-to-end research tasks.
Claude Code — agentic development in the terminal
Claude Code launches as a terminal-native development experience: reads codebases, runs commands, writes and edits files, and navigates full projects. Goes further than autocomplete into agentic development.
Claude Opus 4 + Sonnet 4 — the Claude 4 family launches
Anthropic ships the first Claude 4 models: Opus 4 for frontier tasks and Sonnet 4 for everyday use, both with extended thinking. Also ships: Files API, Code Execution tool, and MCP connector in the API. A step-change in what Claude can do in production.
Web search — Claude accesses the live internet
Web search launches in the API, giving Claude access to up-to-date information. Closes the biggest gap between Claude and web-native AI products.
GPT-4o — natively multimodal with real-time voice
GPT-4o ships as a single model handling voice, image, and text without switching modes. Real-time voice with sub-second latency. Brings the conversational AI interaction model much closer to reality.
LLaMA 4 — frontier open-weights model
Meta releases LLaMA 4 with performance competitive with GPT-4-class models. Largest open-weights release to date. Reinforces that the gap between open-source and proprietary is closing faster than expected.
Claude Sonnet 3.7 — extended thinking in production
Claude Sonnet 3.7 ships as the first Claude model with extended thinking built in — a mode where Claude reasons step-by-step before answering. Near-instant responses or visible thinking, one model. Anthropic's first mainstream reasoning model.
Agentic AI enters mainstream product conversation
The term "AI agents" crosses from research into mainstream product discussion. Every major AI lab ships or announces agentic products. The question shifts from "can AI do this?" to "how do you orchestrate AI to do this reliably?"
Citations API — source attribution for document work
Claude gains the ability to cite its sources when answering from documents — pointing to the exact passage it drew from. Major unlock for trust in enterprise document workflows and RAG applications.
Gemini Advanced with Deep Research
Google ships Deep Research in Gemini Advanced — a mode that autonomously researches a topic across the web over several minutes and returns a structured report. First mainstream implementation of a multi-step research agent.
Batches, Token Counting, Prompt Caching — all go GA
Anthropic moves several API features to general availability: Message Batches API (50% cost reduction on batch jobs), Token Counting API, Prompt Caching (90% cost reduction), and PDF support. Also ships Go and Java SDKs.
Model Context Protocol (MCP) — open standard for tool connections
Anthropic releases MCP as an open protocol for connecting AI models to external tools and data sources. Other companies adopt it quickly, making MCP a de-facto standard. The shift from proprietary to standardized AI integrations.
Claude Haiku 3.5 — fast, affordable, upgraded
Claude Haiku 3.5 launches as a fast, cost-efficient model for real-time applications and high-volume tasks. Better than Haiku 3 at a similar price point.
Computer Use — Claude controls a desktop
Anthropic ships Computer Use in public beta: Claude can move a mouse, click, type, and navigate GUI applications. First mainstream API for AI-controlled computer interaction. Sets the foundation for desktop-level automation.
o1 — reasoning model with explicit chain-of-thought
OpenAI releases o1, a model that reasons explicitly before answering. Significantly outperforms GPT-4 on math and science benchmarks. Introduces "reasoning model" as a distinct category from "chat model."
Console Workspaces — project-level API management
Workspaces launch in the Developer Console: custom spend limits, grouped API keys, usage tracking by project, and user roles. First real admin layer for teams using the Claude API.
Prompt Caching — 90% cost reduction on repeated context
Prompt caching launches in beta: cache and reuse prompt content to reduce costs by up to 90% and latency by up to 80%. Major cost unlock for applications using the same system prompt or documents repeatedly.
Claude Sonnet 3.5 + Artifacts — a new interaction model
Claude Sonnet 3.5 launches with benchmark-leading coding performance. Artifacts ship alongside: a side-panel for code, documents, and interactive outputs. The shift from text responses to living, editable artifacts.
Claude 3 family — Haiku, Sonnet, Opus
Anthropic ships the Claude 3 model family: three tiers for speed vs. capability tradeoffs. Opus leads on benchmarks. Haiku enables real-time, cost-efficient use cases. Establishes the tiered naming convention still in use today.
Gemini 1.0 Ultra — Google's frontier model
Google rebrands Bard to Gemini and ships the Ultra tier. Natively multimodal from architecture rather than retrofit. Signals Google's intent to compete directly with GPT-4 and Claude at the frontier.
Claude 1 — Constitutional AI in production
Anthropic ships the first public Claude, trained using Constitutional AI — aligning model behavior using a set of principles rather than purely human feedback. First major alternative to GPT at capability scale.
GPT-4 — multimodal, significantly improved reasoning
GPT-4 launches with vision capabilities and measurably better reasoning than GPT-3.5. Triggers the "AI moment" for enterprise adoption.
ChatGPT — the moment AI went mainstream
ChatGPT launches and reaches 100 million users in two months — the fastest consumer product adoption in history. Not the most capable model at the time, but the one that made AI accessible to everyone. Everything since has been a response to this moment.
This timeline covers launches relevant to operators, founders, and teams building with AI — not an exhaustive record of every model release or research paper. Focus is on things that changed what was practically possible. Updated as things ship.