No Result
View All Result
  • Login
Wednesday, February 4, 2026
FeeOnlyNews.com
  • Home
  • Business
  • Financial Planning
  • Personal Finance
  • Investing
  • Money
  • Economy
  • Markets
  • Stocks
  • Trading
  • Home
  • Business
  • Financial Planning
  • Personal Finance
  • Investing
  • Money
  • Economy
  • Markets
  • Stocks
  • Trading
No Result
View All Result
FeeOnlyNews.com
No Result
View All Result
Home Startups

Anthropic says some Claude models can now end ‘harmful or abusive’ conversations 

by FeeOnlyNews.com
6 months ago
in Startups
Reading Time: 2 mins read
A A
0
Anthropic says some Claude models can now end ‘harmful or abusive’ conversations 
Share on FacebookShare on TwitterShare on LInkedIn


Anthropic has announced new capabilities that will allow some of its newest, largest models to end conversations in what the company describes as “rare, extreme cases of persistently harmful or abusive user interactions.” Strikingly, Anthropic says it’s doing this not to protect the human user, but rather the AI model itself.

To be clear, the company isn’t claiming that its Claude AI models are sentient or can be harmed by their conversations with users. In its own words, Anthropic remains “highly uncertain about the potential moral status of Claude and other LLMs, now or in the future.”

However, its announcement points to a recent program created to study what it calls “model welfare” and says Anthropic is essentially taking a just-in-case approach, “working to identify and implement low-cost interventions to mitigate risks to model welfare, in case such welfare is possible.”

This latest change is currently limited to Claude Opus 4 and 4.1. And again, it’s only supposed to happen in “extreme edge cases,” such as “requests from users for sexual content involving minors and attempts to solicit information that would enable large-scale violence or acts of terror.”

While those types of requests could potentially create legal or publicity problems for Anthropic itself (witness recent reporting around how ChatGPT can potentially reinforce or contribute to its users’ delusional thinking), the company says that in pre-deployment testing, Claude Opus 4 showed a “strong preference against” responding to these requests and a “pattern of apparent distress” when it did so.

As for these new conversation-ending capabilities, the company says, “In all cases, Claude is only to use its conversation-ending ability as a last resort when multiple attempts at redirection have failed and hope of a productive interaction has been exhausted, or when a user explicitly asks Claude to end a chat.”

Anthropic also says Claude has been “directed not to use this ability in cases where users might be at imminent risk of harming themselves or others.”

Techcrunch event

San Francisco
|
October 27-29, 2025

When Claude does end a conversation, Anthropic says users will still be able to start new conversations from the same account, and to create new branches of the troublesome conversation by editing their responses.

“We’re treating this feature as an ongoing experiment and will continue refining our approach,” the company says.



Source link

Tags: abusiveAnthropicClaudeConversationsHarmfulModels
ShareTweetShare
Previous Post

Analyst Predicts Surge To $13 If This Happens

Next Post

Central Banks Do Not Prevent Financial Crises or Control Inflation

Related Posts

The psychological impact of talking to strangers is real: Studies show it makes us happier and smarter

The psychological impact of talking to strangers is real: Studies show it makes us happier and smarter

by FeeOnlyNews.com
February 3, 2026
0

When researchers asked commuters to strike up conversations with strangers on trains and buses, they discovered something that challenges our...

Bootstrapping Isn’t Noble – It’s Just Another Trap

Bootstrapping Isn’t Noble – It’s Just Another Trap

by FeeOnlyNews.com
February 3, 2026
0

There’s a reason founders romanticize suffering. You get to say you “did it all yourself.” Your startup was forged in...

How LLMs Can Quietly Classify and Organize Your Business Data

How LLMs Can Quietly Classify and Organize Your Business Data

by FeeOnlyNews.com
February 2, 2026
0

Invisible Assistants in the Background Most of the attention in the world of AI goes to visible features: chatbots that...

The personality trait that predicts loneliness better than being single or living alone

The personality trait that predicts loneliness better than being single or living alone

by FeeOnlyNews.com
February 2, 2026
0

You’ve probably heard it a thousand times: loneliness is an epidemic. But here’s what might surprise you – researchers have...

The 12 Largest NYC Tech Startup Funding Rounds of January 2026 – AlleyWatch

The 12 Largest NYC Tech Startup Funding Rounds of January 2026 – AlleyWatch

by FeeOnlyNews.com
February 2, 2026
0

Looking at the largest NYC startup funding rounds from January 2026, leveraging data from CrunchBase, we’ve analyzed the most significant...

The Weekly Notable Startup Funding Report: 2/2/26 – AlleyWatch

The Weekly Notable Startup Funding Report: 2/2/26 – AlleyWatch

by FeeOnlyNews.com
February 2, 2026
0

The Weekly Notable Startup Funding Report takes us on a trip across various ecosystems in the US, highlighting some of...

Next Post
Central Banks Do Not Prevent Financial Crises or Control Inflation

Central Banks Do Not Prevent Financial Crises or Control Inflation

SA Roundtable: What’s next for the rare earths sector? (MP:NYSE)

SA Roundtable: What's next for the rare earths sector? (MP:NYSE)

  • Trending
  • Comments
  • Latest
Self-driving startup Waabi raises up to  billion, partners with Uber to deploy 25,000 robotaxis

Self-driving startup Waabi raises up to $1 billion, partners with Uber to deploy 25,000 robotaxis

January 28, 2026
Student Beans made him a millionaire, a heart condition made this millennial founder rethink life

Student Beans made him a millionaire, a heart condition made this millennial founder rethink life

December 11, 2025
Sellers Are Accepting Even Less

Sellers Are Accepting Even Less

January 23, 2026
Episode 242. “Our couples therapist couldn’t fix this. Please help.”

Episode 242. “Our couples therapist couldn’t fix this. Please help.”

January 6, 2026
US SEC Issues Key Crypto Custody Guidelines For Broker-Dealers

US SEC Issues Key Crypto Custody Guidelines For Broker-Dealers

December 19, 2025
How to sell a minority stake in RIA M&A

How to sell a minority stake in RIA M&A

November 11, 2025
A Warsh Fed is ‘golden’ for banks

A Warsh Fed is ‘golden’ for banks

0
9 Reasons More Than Half of Americans Are Terrified of Their Emergency Savings

9 Reasons More Than Half of Americans Are Terrified of Their Emergency Savings

0
Chunghwa Telecom Delivers Stable FY2025 Performance as Mobile and Broadband Support Growth

Chunghwa Telecom Delivers Stable FY2025 Performance as Mobile and Broadband Support Growth

0
Sun shines on Waaree Energies as tariff clouds clear

Sun shines on Waaree Energies as tariff clouds clear

0
Ukraine & Trump | Armstrong Economics

Ukraine & Trump | Armstrong Economics

0
Clorox outlines 0–1% category growth target and innovation-led recovery as ERP transition ends (NYSE:CLX)

Clorox outlines 0–1% category growth target and innovation-led recovery as ERP transition ends (NYSE:CLX)

0
Clorox outlines 0–1% category growth target and innovation-led recovery as ERP transition ends (NYSE:CLX)

Clorox outlines 0–1% category growth target and innovation-led recovery as ERP transition ends (NYSE:CLX)

February 3, 2026
Sun shines on Waaree Energies as tariff clouds clear

Sun shines on Waaree Energies as tariff clouds clear

February 3, 2026
China set to attend India’s upcoming AI summit signaling improving relations with New Delhi

China set to attend India’s upcoming AI summit signaling improving relations with New Delhi

February 3, 2026
Ukraine & Trump | Armstrong Economics

Ukraine & Trump | Armstrong Economics

February 3, 2026
9 Reasons More Than Half of Americans Are Terrified of Their Emergency Savings

9 Reasons More Than Half of Americans Are Terrified of Their Emergency Savings

February 3, 2026
Dividend Aristocrats In Focus: W.W. Grainger

Dividend Aristocrats In Focus: W.W. Grainger

February 3, 2026
FeeOnlyNews.com

Get the latest news and follow the coverage of Business & Financial News, Stock Market Updates, Analysis, and more from the trusted sources.

CATEGORIES

  • Business
  • Cryptocurrency
  • Economy
  • Financial Planning
  • Investing
  • Market Analysis
  • Markets
  • Money
  • Personal Finance
  • Startups
  • Stock Market
  • Trading

LATEST UPDATES

  • Clorox outlines 0–1% category growth target and innovation-led recovery as ERP transition ends (NYSE:CLX)
  • Sun shines on Waaree Energies as tariff clouds clear
  • China set to attend India’s upcoming AI summit signaling improving relations with New Delhi
  • Our Great Privacy Policy
  • Terms of Use, Legal Notices & Disclaimers
  • About Us
  • Contact Us

Copyright © 2022-2024 All Rights Reserved
See articles for original source and related links to external sites.

Welcome Back!

Sign In with Facebook
Sign In with Google
Sign In with Linked In
OR

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Home
  • Business
  • Financial Planning
  • Personal Finance
  • Investing
  • Money
  • Economy
  • Markets
  • Stocks
  • Trading

Copyright © 2022-2024 All Rights Reserved
See articles for original source and related links to external sites.