AI Codex
Tools & Ecosystem ClaudeExecutivesCTOs

AI Safety Level

Also: ASL

Anthropic's internal framework for classifying how capable and potentially risky an AI model is — and what safety commitments apply at each level. ASL-1: minimal risk. ASL-2: current Claude models (meaningful capabilities, manageable risks). ASL-3+: future models with capabilities that would require significantly stronger safety measures before deployment. The ASL framework is Anthropic's public commitment to pause or restrict deployment if a model reaches certain capability thresholds without corresponding safety measures.

In practice

Anthropic tracks how capable Claude is getting across dangerous domains — like helping with biological weapons or sophisticated cyberattacks. AI Safety Levels are internal thresholds that trigger stricter safeguards as capabilities cross certain lines. ASL-2 is current Claude. ASL-3 would require new protective measures before deployment.

Related concepts