SCIENCE & TECH: Anthropic drops its signature safety promise and rewrites AI guardrails

🔴 Website 👉 https://u-s-news.com/
Telegram 👉 https://t.me/usnewscom_channel

Anthropic has removed its pledge not to train or release AI models without guaranteed safety mitigations in advance
The company will now rely on transparency reports and safety roadmaps instead of strict preconditions
Critics argue the shift shows the limits of voluntary AI safety commitments without binding regulation

Anthropic has formally abandoned the central promise not to train or release frontier AI systems unless it can guarantee adequate safety in advance. The company behind Claude confirmed the decision in an interview with Time, marking the end of a policy that had once set it apart among AI developers. The newly revised Responsible Scaling Policy focuses more on ensuring the company stays competitive as the AI marketplace heats up.

For years, Anthropic framed that pledge as evidence that it would resist the commercial pressures pushing competitors to ship ever more powerful systems. The policy effectively barred it from advancing beyond certain levels unless predefined safety measures were already in place. Now, Anthropic is using a more flexible framework rather than categorical pauses.

The company insists the change is pragmatic rather than ideological. Executives argue that unilateral restraint no longer makes sense in a market defined by rapid iteration and geopolitical urgency. But the shift feels like a turning point in how the AI industry thinks about self-regulation.

Under the new Responsible Scaling Policy, Anthropic pledges to publish detailed “Frontier Safety Roadmaps” outlining its planned safety milestones, along with regular “Risk Reports” that assess model capabilities and potential threats. The company also says it will match or exceed competitors’ safety efforts and delay development if it both believes it leads the field and identifies significant catastrophic risk. What it will no longer do is promise to halt training until all mitigations are guaranteed in advance.

Everyday users might not notice any changes as they interact with Claude or other AI tools. Yet the guardrails that govern how those systems are trained influence everything from accuracy to fraudulent misuse. When the company, once defined by its strict preconditions, decides those conditions are no longer workable, it signals a broader recalibration within the industry.

CLICK and SUPPORT

Claude control

When Anthropic introduced its original policy in 2023, some executives hoped it might inspire rivals or even inform eventual regulation. That regulatory momentum never fully materialized. Federal AI legislation remains stalled, and the broader political climate has tilted away from developing any framework. Companies are left to choose between voluntary restraint and competitive survival.

Anthropic is growing rapidly, with both revenue and its portfolio surpassing rivals like OpenAI and Google, even poking fun at ChatGPT getting ads in a Super Bowl advertisement. But the company clearly saw the safety redline as an impediment to that growth.

Anthropic maintains that its revised framework preserves meaningful safeguards. The new Roadmaps are intended to create internal pressure to prioritize mitigation research. The forthcoming Risk Reports aim to provide a clearer public accounting of how model capabilities might lead to misuse.

“The new policy still includes some guardrails, but the core promise, that Anthropic would not release models unless it could guarantee adequate safety mitigations in advance, is gone,” said Nik Kairinos, CEO and co-founder of RAIDS AI, an organization focused on independent monitoring and risk detection in AI. “This is precisely why continuous, independent monitoring of AI systems matters. Voluntary commitments can be rewritten. Regulation, backed by real-time oversight, cannot.”

SCIENCE & TECH: Elon Musk’s SpaceX mulling merger with Tesla or xAI: report

Kairinos also noted the irony in Anthropic’s $20 million a couple of weeks ago to Public First Action, a group supporting congressional candidates pledging to push for AI safety regulation. That contribution, he suggested, underscores the complexity of the current moment. Companies may advocate for stronger regulation while simultaneously recalibrating their own internal constraints.

CLICK and SUPPORT

(Image credit: Getty Images/Smith Collection/Gado)

The broader question facing the industry is whether voluntary norms can meaningfully shape the trajectory of transformative technologies. Anthropic once attempted to anchor itself as a model of restraint. Its revised policy requires it to compensate for competition. That does not mean safety has been abandoned, but it does mean the order of operations has shifted.

The average person may not read Responsible Scaling Policies or Risk Reports, but they live with the downstream effects of those decisions. Anthropic argues that meaningful safety research requires staying at the frontier, not stepping back from it. Whether that philosophy proves reassuring or unsettling depends largely on one’s view of how fast AI should move and how much risk society is willing to tolerate in exchange for progress.

Follow TechRadar on Google News and add us as a preferred source to get our expert news, reviews, and opinion in your feeds. Make sure to click the Follow button!

And of course you can also follow TechRadar on TikTok for news, reviews, unboxings in video form, and get regular updates from us on WhatsApp too.

Source link

CLICK and SUPPORT

OnGo247

New 100% Free
Social Platform
ONGO247.COM
Give it a spin!
Sign Up Today

YES

OnGo247

New 100% Free
Social Platform
ONGO247.COM
Give it a spin!
Sign Up Today

YES

Sun	Mon	Tue	Wed	Thu	Fri

+12°	+11°	+6°	+6°	+7°	+9°
+4°	+4°	+2°	+1°	0°	+3°

SCIENCE & TECH: Elon Musk’s SpaceX mulling merger with Tesla or xAI: report

NEWS HEADLINES: FBI raids home and offices of LAUSD Superintendent Alberto Carvalho in Tri-City investigation – One America News Network

NEWS HEADLINES: Trump’s Military Faces China Nightmare

NEWS HEADLINES: Amazon To Make $12 Billion Data Center Investment In Red State * 100PercentFedUp.com * by Danielle

POLITICS: Judge Allows Trump BBC Lawsuit for $10 Billion – Video

POLITICS: Dems ditched American heroes for adults in frog suits — pity them

POLITICS: Space Force pauses national security launches on Vulcan – USSA News

MONEY & BUSINESS: Warner Bros gets a higher offer from Paramount in heated fight for the storied Hollywood studio – One America News Network

MONEY & BUSINESS: Lilly to launch multi-dose weight-loss drug device in US – One America News Network

MONEY & BUSINESS: Moderna says the FDA will consider its new flu shot after resolving a public dispute – One America News Network

STOCK MARKET: Pokémon card winner Scaramucci says collectibles are asset class

STOCK MARKET: Leveraged fund, options trading surges since pandemic: data

STOCK MARKET: Top analysts are bullish on the growth potential of these 3 stocks

WHITE HOUSE VIDEO: Vice President Vance and Administrator Oz Announce Actions to Address Fraud, Waste, and Abuse

WHITE HOUSE VIDEO: VP JD Vance: “They wouldn’t stand for the idea that American citizens should come first”

WHITE HOUSE VIDEO: President Donald J. Trump’s 2026 State of the Union Address

SCIENCE & TECH: Anthropic drops its signature safety promise and rewrites AI guardrails

SCIENCE & TECH: AI systems more ready to drop nukes in escalating geopolitical crises: war games study

SCIENCE & TECH: Waymo’s robotaxis now being dispatched in 10 major U.S. markets with expansion in Texas and Florida – One America News Network

GOSSIP & RUMORS: Frail Andy Dick spotted being pushed in wheelchair in LA after rehab stint

GOSSIP & RUMORS: Robert Carradine, ‘Revenge of the Nerds’ and ‘Lizzie McGuire’ star, dies at 71 – One America News Network

GOSSIP & RUMORS: Chris Hemsworth admits ditching LA was ‘greatest decision’ he’s made

SCIENCE & TECH: Anthropic drops its signature safety promise and rewrites AI guardrails

SCIENCE & TECH: AI systems more ready to drop nukes in escalating geopolitical crises: war games study

SCIENCE & TECH: Waymo’s robotaxis now being dispatched in 10 major U.S. markets with expansion in Texas and Florida – One America News Network

SCIENCE & TECH: Sonos apparently has yet another app overhaul in the works, hopes you’ve forgotten 2024

SCIENCE & TECH: iPhone under threat of fake calendar app scam — how to shut it down