AI Codex
Infrastructure & DeploymentDevelopersCTOs

Throughput

The volume of requests a system can handle per unit time — the capacity dimension of AI infrastructure, as opposed to latency's speed dimension.