What is Jailbreaking?

Question

Accepted Answer

Techniques people use to get AI models to bypass their safety guidelines — often by framing requests as hypothetical, fictional, or through elaborate roleplays. Jailbreaking is an ongoing challenge: as AI companies tighten safeguards, people find new ways around them. Claude is designed to maintain its values across different framings. Understanding jailbreaking matters for operators because some users will try it on your deployed AI product.