What is AI Safety?

Question

What is AI Safety?

Accepted Answer

The field and practice of ensuring AI systems do what they're supposed to do, without causing unintended harm — at every scale from today's applications to future, more powerful systems. At the deployment level: guardrails, safety testing, content filtering. At the research level: alignment, interpretability, and understanding how AI models make decisions. Anthropic was founded specifically to work on AI safety and publishes extensively on the topic.