Anthropic, OpenAI Dial Back Safety Language as AI Race Accelerates

In brief TIME reports Anthropic dropped a pledge to halt training without guaranteed safeguards. OpenAI also removed โ€œsafelyโ€ from its mission after restructuring into a for-profit entity. Experts say the shift reflects political, economic, and intellectual changes. Anthropic has dropped a central safety pledge from its Responsible Scaling Policy, according to a report by TIME.…


Anthropic, OpenAI Dial Back Safety Language as AI Race Accelerates
Anthropic, OpenAI Dial Back Safety Language as AI Race Accelerates

In brief

  • TIME reports Anthropic dropped a pledge to halt training without guaranteed safeguards.
  • OpenAI also removed โ€œsafelyโ€ from its mission after restructuring into a for-profit entity.
  • Experts say the shift reflects political, economic, and intellectual changes.

Anthropic has dropped a central safety pledge from its Responsible Scaling Policy, according to a report by TIME. The changes loosen a commitment that once barred the Claude AI developer from training advanced AI systems without guaranteed safeguards in place.

The move reshapes how the company positions itself in the AI race against rivals OpenAI, Google, and xAI. Anthropic has long cast itself as one of the industryโ€™s most safety-focused labs, but under the revised policy, Anthropic no longer promises to halt training if risk mitigations are not fully in place.

โ€œWe felt that it wouldn’t actually help anyone for us to stop training AI models,โ€ Anthropicโ€™s chief science officer, Jared Kaplan, told TIME. โ€œWe didn’t really feel, with the rapid advance of AI, that it made sense for us to make unilateral commitments โ€ฆ if competitors are blazing ahead.โ€

The change comes as Anthropic finds itself embroiled in a public dispute withย U.S. Defense Secretaryย Pete Hegseth over refusing to grant the Pentagonย full access to Claude, making it the only major AI lab among Google, xAI, Meta, and OpenAI to take that stance.

Edward Geist, a senior policy researcher at the RAND Corporation, said the earlier โ€œAI safetyโ€ framing emerged from a specific intellectual community that predated todayโ€™s large language models.

โ€œAs of a few years ago, there was the field of AI safety,โ€ Geist told Decrypt. โ€œAI safety was associated with a particular set of views that came out of the community of people who cared about powerful AI before we had these LLMs.โ€

Geist said early AI safety advocates were working from a very different vision of what advanced artificial intelligence would look like.

โ€œThey ended up conceptualizing the problem in a way that, in some respects, was envisioning something qualitatively different from these current LLMs, for better or worse,โ€ Geist said.

Geist said the language change also sends a signal to investors and policymakers.

โ€œPart of it is signaling to various constituencies that a lot of these companies want to give the impression that they are not holding back in the economic competition because of concerns about โ€˜AI safety,โ€™โ€ he said, adding that the terminology itself is changing to fit the times.

Anthropic is not alone in revising its safety language.

What defines AI safety?

A recent report by the non-profit news organization, The Conversation, noted how OpenAI also changed its mission statement in its 2024 IRS filing, removing the word โ€œsafely.โ€

The companyโ€™s earlier statement pledged to build general-purpose AI that โ€œsafely benefits humanity, unconstrained by a need to generate financial return.โ€ The updated version now states its goal is โ€œto ensure that artificial general intelligence benefits all of humanity.โ€

โ€œThe problem with the term AI security is that no one seems to know what that means exactly,โ€ Geist said. โ€œThen again, the AI safety term was also contested.โ€

Anthropicโ€™s new policy emphasizes transparency measures such as publishing โ€œfrontier safety roadmapsโ€ and regular โ€œrisk reports,โ€ and says it will delay development if it believes there is a significant risk of catastrophe.

Anthropic and OpenAIโ€™s policy shifts come as the companies look to strengthen their commercial position.

Earlier this month, Anthropic said it raised $30 billion at a valuation of about $380 billion. At the same time, OpenAI is finalizing a funding round backed by Amazon, Microsoft, and Nvidia that could reach $100 billion.

Anthropic and OpenAI, along with Google and xAI, have been awarded lucrative government contracts with the U.S. Department of Defense. For Anthropic, however, the contract appears in doubt as the Pentagon weighs whether to cut ties to the AI firm over access complaints.

As capital pours into the sector and geopolitical competition intensifies, Hamza Chaudhry, AI and National Security Lead at the Future of Life Institute, said the policy change reflects shifting political dynamics rather than a bid for Pentagon business.

โ€œIf that were the case, they would have just backed down from what the Pentagon said a week ago,โ€ Chaudhry told Decrypt. โ€œDario [Amodei] wouldn’t have shown up to meet.โ€

Instead, Chaudhry said the rewrite reflects a turning point in how AI companies talk about risk as political pressure and competitive stakes rise.

โ€œAnthropic is now saying, โ€˜Look, we can’t keep saying safety, we can’t unconditionally pause, and we’re going to push for much lighter-touch regulation,โ€™โ€ he said.

Daily Debrief Newsletter

Start every day with the top news stories right now, plus original features, a podcast, videos and more.

Source link