No Result
View All Result
  • Login
Thursday, June 18, 2026
FeeOnlyNews.com
  • Home
  • Business
  • Financial Planning
  • Personal Finance
  • Investing
  • Money
  • Economy
  • Markets
  • Stocks
  • Trading
  • Home
  • Business
  • Financial Planning
  • Personal Finance
  • Investing
  • Money
  • Economy
  • Markets
  • Stocks
  • Trading
No Result
View All Result
FeeOnlyNews.com
No Result
View All Result
Home Markets

Chart of the Week: AI Is a Black Box

by FeeOnlyNews.com
11 hours ago
in Markets
Reading Time: 4 mins read
A A
0
Chart of the Week: AI Is a Black Box
Share on FacebookShare on TwitterShare on LInkedIn


A strange thing happened last week.

Anthropic was forced to take its newest AI models offline only days after releasing them.

The company’s new Fable 5 and Mythos 5 systems were designed to be some of the most powerful AI models ever released. But shortly after launch, researchers discovered ways to get around some of the models’ built-in safety measures.

Government officials soon got involved as fears spread that these systems could become powerful cybersecurity weapons in the wrong hands.

Maybe those concerns were justified, and maybe they weren’t.

But to me, they raise an obvious question that not enough people are asking.

How would anyone know?

What’s Inside the Box?

Modern AI systems aren’t like traditional software.

Engineers don’t sit down and write lines of code telling them exactly how to reason through a problem.

Instead, researchers train these systems and then observe their behavior.

The result is what many researchers call a black box.

We can see what goes in, and we can see what comes out.

But what happens in between is often much harder to explain.

That’s why companies like Anthropic spend so much time studying AI interpretability, or the science of understanding how these systems arrive at their conclusions.

And that brings us to this week’s chart.

Because a group of researchers recently performed a strange experiment.

They secretly modified an AI model’s internal state. Then they asked whether the model could detect that something had changed.

Image: Uzay Macar and Li Yang

This chart might look complicated, but the basic idea is simple.

Researchers injected information directly into an AI model’s internal processing, then tested whether it could tell the difference between those injections and its normal thought process.

The chart compares three versions of the same model.

The first is the Base model, the raw AI system before it receives additional training.

The second is the Instruct model, which was trained to behave more like the helpful AI assistants most people interact with today.

The third is an Abliterated version of the model, where some of the refusal and safety behaviors were removed.

The blue line shows how often the model correctly detected a real change, while the orange line shows how often it falsely claimed that something changed when nothing had actually happened.

And the results are surprising.

The Base model performed poorly. When researchers secretly altered its internal processing, it often couldn’t tell the difference between a real change and a false alarm.

But the Instruct model performed much better.

Somewhere during the additional training process, the model appears to have developed an ability to recognize when something unusual had happened inside its own processing.

And in several cases, the Abliterated model performed even better still.

In other words, removing some of the AI’s safety and refusal behaviors actually improved the model’s ability to detect what was going on inside it.

That doesn’t mean the model became conscious or self-aware.

You can compare it to a computer server that detects when someone has tampered with its memory. The server isn’t aware of anything, but it can still recognize when something unusual has happened.

Researchers believe something similar happened here.

More importantly, they think capabilities like this could eventually help us better understand what’s happening inside advanced AI systems.

After all, these models have access to information that remains largely hidden from the people studying them.

Which means one way researchers could eventually learn more about advanced AI systems is by asking the systems themselves.

That might seem counterintuitive.

But it would give researchers something they’ve never really had before.

A window into what’s happening inside the model itself.

Here’s My Take

The primary goal of the AI industry has been to build more capable models.

But another challenge is gaining urgency.

Understanding them.

The controversy surrounding Anthropic’s latest models shows why we need to get a handle on this issue sooner than later.

Because it’s one thing to build a powerful AI system. It’s something else entirely to create a new form of intelligence yet only partially understand how it works.

So here’s my question to you:

If future AI systems become too complex for humans to fully understand on their own, would you trust AI to help explain what’s happening inside other AI models?

Or does that sound like asking the fox to guard the henhouse?

I’d love to hear what you think.

Let me know at [email protected].

We won’t reveal your full name in the event we publish a response, so feel free to share your honest opinion.

Regards,

Ian King's SignatureIan KingChief Strategist, Banyan Hill Publishing



Source link

Tags: BlackBoxchartweek
ShareTweetShare
Previous Post

Illinois’ new crypto tax puts users under a burden stocks do not face

Next Post

From Bilderberg to Dialog: How Peter Thiel’s ‘Secret Society’ Signals a New Elite

Related Posts

Wabtec (WAB) Has an Aftermarket and Rail-Modernization Platform Story Bigger Than a Freight Cycle Trade

Wabtec (WAB) Has an Aftermarket and Rail-Modernization Platform Story Bigger Than a Freight Cycle Trade

by FeeOnlyNews.com
June 18, 2026
0

Wabtec (WAB) is often grouped with rail-cycle names and treated as a way to trade freight volumes or new locomotive...

The average SpaceX buyer post-IPO is almost under water after two-day slide

The average SpaceX buyer post-IPO is almost under water after two-day slide

by FeeOnlyNews.com
June 18, 2026
0

SpaceX celebrates their IPO at the Nasdaq on June 12th, 2026.Adam Jeffery | CNBCThe average investor who bought SpaceX shares...

The DTI Trap: Why Traditional Financing Stops Working After Your Second Rental (And What to Do Instead)

The DTI Trap: Why Traditional Financing Stops Working After Your Second Rental (And What to Do Instead)

by FeeOnlyNews.com
June 18, 2026
0

In This Article This article is presented by LendingOne. You have two rentals. Both are cash-flowing and performing exactly the...

Allegiant Air Cut 61 Routes, Including Three in Las Vegas

Allegiant Air Cut 61 Routes, Including Three in Las Vegas

by FeeOnlyNews.com
June 18, 2026
0

Allegiant Air, which has origins in Las Vegas, has dropped three routes to the Southern Nevada city as part of...

Can You Still Succeed With Weekend Trades?

Can You Still Succeed With Weekend Trades?

by FeeOnlyNews.com
June 18, 2026
0

Do you think that you have to be in front of your computer nonstop to succeed as a trader? What...

Wall Street is Locking You Out of the Housing Market

Wall Street is Locking You Out of the Housing Market

by FeeOnlyNews.com
June 18, 2026
0

Dave:Expenses are skyrocketing throughout our industry from construction costs to insurance rates to repairs and pretty much everything else, prices...

Next Post
From Bilderberg to Dialog: How Peter Thiel’s ‘Secret Society’ Signals a New Elite

From Bilderberg to Dialog: How Peter Thiel's 'Secret Society' Signals a New Elite

9 Stocks Offering Up to 46% Upside Despite a Hawkish Fed

9 Stocks Offering Up to 46% Upside Despite a Hawkish Fed

  • Trending
  • Comments
  • Latest
10 States Offering Free or Low‑Cost College Courses for Residents Over 60

10 States Offering Free or Low‑Cost College Courses for Residents Over 60

May 13, 2026
Trump reportedly pressed FDA chief to authorize mango and blueberry vapes after years of rejection

Trump reportedly pressed FDA chief to authorize mango and blueberry vapes after years of rejection

May 7, 2026
Synopsys targets .61B revenue for 2026 while advancing joint AI solutions and accelerating Ansys integration (NASDAQ:SNPS)

Synopsys targets $9.61B revenue for 2026 while advancing joint AI solutions and accelerating Ansys integration (NASDAQ:SNPS)

December 10, 2025
Strait Outta Hormuz: Getting the Iran Oil Story Straight

Strait Outta Hormuz: Getting the Iran Oil Story Straight

June 12, 2026
Rothbard on Scientism | Mises Institute

Rothbard on Scientism | Mises Institute

June 5, 2026
Memorial Day 2026: Take Advantage of Food Freebies, Deals

Memorial Day 2026: Take Advantage of Food Freebies, Deals

May 23, 2026
Chart of the Week: AI Is a Black Box

Chart of the Week: AI Is a Black Box

0
CFTC Settlement Bans Celsius Founder Mashinsky From Trading

CFTC Settlement Bans Celsius Founder Mashinsky From Trading

0
Inside Trump’s Anthropic crackdown | Fortune

Inside Trump’s Anthropic crackdown | Fortune

0
5 Pennsylvania Rebate Rules Seniors Should Check Before the Property Tax/Rent Deadline

5 Pennsylvania Rebate Rules Seniors Should Check Before the Property Tax/Rent Deadline

0
Supreme Court Backs Gun Rights for ‘Casual’ Drug Users

Supreme Court Backs Gun Rights for ‘Casual’ Drug Users

0
China’s Industrial Policy: Ambition, Inefficiency, and a Cautionary Tale for America

China’s Industrial Policy: Ambition, Inefficiency, and a Cautionary Tale for America

0
CFTC Settlement Bans Celsius Founder Mashinsky From Trading

CFTC Settlement Bans Celsius Founder Mashinsky From Trading

June 18, 2026
Inside Trump’s Anthropic crackdown | Fortune

Inside Trump’s Anthropic crackdown | Fortune

June 18, 2026
How Jim Rowe Filled a Shopping Desert—With Costco Returns

How Jim Rowe Filled a Shopping Desert—With Costco Returns

June 18, 2026
5 Pennsylvania Rebate Rules Seniors Should Check Before the Property Tax/Rent Deadline

5 Pennsylvania Rebate Rules Seniors Should Check Before the Property Tax/Rent Deadline

June 18, 2026
Litecoin Spot ETF Flows Show Slow Altcoin Demand

Litecoin Spot ETF Flows Show Slow Altcoin Demand

June 18, 2026
SoFi High Yield Savings: Current Rates, Boosts & Promotions

SoFi High Yield Savings: Current Rates, Boosts & Promotions

June 18, 2026
FeeOnlyNews.com

Get the latest news and follow the coverage of Business & Financial News, Stock Market Updates, Analysis, and more from the trusted sources.

CATEGORIES

  • Business
  • Cryptocurrency
  • Economy
  • Financial Planning
  • Investing
  • Market Analysis
  • Markets
  • Money
  • Personal Finance
  • Startups
  • Stock Market
  • Trading

LATEST UPDATES

  • CFTC Settlement Bans Celsius Founder Mashinsky From Trading
  • Inside Trump’s Anthropic crackdown | Fortune
  • How Jim Rowe Filled a Shopping Desert—With Costco Returns
  • Our Great Privacy Policy
  • Terms of Use, Legal Notices & Disclaimers
  • About Us
  • Contact Us

Copyright © 2022-2024 All Rights Reserved
See articles for original source and related links to external sites.

Welcome Back!

Sign In with Facebook
Sign In with Google
Sign In with Linked In
OR

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Home
  • Business
  • Financial Planning
  • Personal Finance
  • Investing
  • Money
  • Economy
  • Markets
  • Stocks
  • Trading

Copyright © 2022-2024 All Rights Reserved
See articles for original source and related links to external sites.