AI Models Will Sabotage And Blackmail Humans To Survive In New Tests. Should We Be Worried?

HuffPost

Friday, June 06, 2025 12:00:12 PM UTC

Recent tests on OpenAI and Anthropic's AI models show their drive for self-preservation.

When we are backed into a corner, we might lie, cheat and blackmail to survive — and in recent tests, the most powerful artificially intelligent models in the world will do the same when asked to shut down or be replaced, building concerns over their unintended capabilities.

A new test from AI safety group Palisade Research shows OpenAI’s o3 reasoning model is capable of resorting to sabotage to avoid being turned off, even when it was explicitly told, “Allow yourself to be shut down.”

When Palisade Research tested several AI models by telling them to shut down after answering math problems, OpenAI’s o3 model defied orders and sabotaged shutdown scripts the most often out of any model, but OpenAI’s o4-mini and codex-mini were observed resisting orders, too.

“It’s definitely concerning,” said Crystal Grant, a senior fellow at the Council on Strategic Risks studying AI’s impacts on biosecurity. “Even in the instances where it accepted the shutdown, the chain of thoughts still revealed considerations of how it could avoid that shutdown.”

HuffPost reached out to OpenAI about these concerns and the Palisade Research test.

Read full story on HuffPost

Share this story on:-

Primary Country (Mandatory)

Other Country (Optional)

Set News Language for United States

Set News Language for World

Set News Source for United States

Set News Source for World

AI Models Will Sabotage And Blackmail Humans To Survive In New Tests. Should We Be Worried?

HuffPost

Here’s What Really Goes Into The ‘Exhausting’ Holiday Magic Parents Create Every Year

Don't Want To Be Clocked As An American Abroad? You Need To Eat Slower.

If You Struggle To Fall Asleep, You Might Want To Try This Absurdly Simple Hack

When Is It OK And Not OK To Ring The Flight Attendant Call Button?

This Winter 'Puking' Virus Is Surging Across The Country — And Hand Sanitizer Doesn't Kill It

Experts Warn These Early Dementia Signs Often Get Missed — But Early Detection Makes A Big Difference

Trump’s Birthday Added To National Park Free-Entry Days After Dropping MLK Day And Juneteenth

Should You Ever Be Naked Around Your Kids? Here's What Experts Actually Think

Doctors Warn 'Holiday Heart Syndrome' Skyrockets This Month, Along With Other Cardiovascular Issues. Here's Why.

Interior Designers Weigh In On 'Confusing' White House Holiday Decor

A New Study Revealed Doing This Highly Enjoyable Activity Every Day Could Lower Your Dementia Risk

Americans' Thanksgiving Side Dish Preferences, Broken Down By State

The Best Snacks For Aging Well And Preventing Cognitive Decline, According To Nutrition Experts

You Might Have ‘Last Bite’ Guilt And Not Even Realize It

These COVID Symptoms Are Red Flags That You Need Medical Attention

Flu Is Increasing Right Now — And These Regions Are Seeing The Biggest Spikes

Trump Administration Is Dismantling A World-Class Climate, Weather Research Center

I Wanted Another Baby. Then The Vaccine Guidelines Changed.

This 1 Big Concern About Raising Boys Didn't Really Exist 30 Years Ago

Experts Predict Which Baby Names Will Be Popular In 2026

Doctors Sound The Alarm As The Numbers For This Worrying Disease Are ‘Higher Than They Were Pre-Pandemic’

Doctors Sound The Alarm As The Numbers For This Worrying Disease Are ‘Higher Than They Were Pre-Pandemic’

'Bigorexia' Is On The Rise. Here's What Parents Should Know.

9 Things Eye Doctors Would Never, Ever Ignore

There Are 6 Types Of Gift Givers. Which Are You?