AI Codex
Foundation Models & LLMsDevelopersCTOs

Transformer Architecture

Also: Transformer

The engineering design that underlies virtually all modern AI language models — including Claude, GPT-4, and Gemini. Introduced by Google researchers in 2017, it replaced older sequential designs and made it possible to process language in parallel at massive scale. You never interact with it directly — it's the technical architecture that makes large-scale AI possible, the same way you don't think about car engine design when you're driving.

In practice

Every time Claude reads your message and writes a coherent response — understanding context, maintaining tone, following instructions — the transformer architecture is what makes that possible. You never interact with it directly. It's the engineering design introduced in 2017 that made large-scale AI practical. All modern AI assistants are built on variations of it.

Related concepts