Anthropic, the wildly successful AI company that has cast itself as the most safety-conscious of the top research labs, is dropping the central pledge of its flagship safety policy, company officials tell TIME.

In 2023, Anthropic committed to never train an AI system unless it could guarantee in advance that the company’s safety measures were adequate. For years, its leaders touted that promise—the central pillar of their Responsible Scaling Policy (RSP)—as evidence that they are a responsible company that would withstand market incentives to rush to develop a potentially dangerous technology.

But in recent months the company decided to radically overhaul the RSP. That decision included scrapping the promise to not release AI models if Anthropic can’t guarantee proper risk mitigations in advance.

  • XLE@piefed.social
    link
    fedilink
    English
    arrow-up
    2
    ·
    edit-2
    7 hours ago

    Funny timeline

    • February 24: Anthropic drops “responsible” policy
    • February 25: Defense Department gives Anthropic a deadline
    • February 27: Trump orders cutting ties