Publish: 09:54, 29 Jun, 2025

AI is learning to lie, scheme, and threaten its creators

Online Desk
AI is learning to lie, scheme, and threaten its creators
Representative Photo:Collected

The world's most advanced AI models are exhibiting troubling new behaviors - lying, scheming, and even threatening their creators to achieve their goals.

In one particularly jarring example, under threat of being unplugged, Anthropic's latest creation Claude 4 lashed back by blackmailing an engineer and threatened to reveal an extramarital affair.

Meanwhile, ChatGPT-creator OpenAI's o1 tried to download itself onto external servers and denied it when caught red-handed.

These episodes highlight a sobering reality: more than two years after ChatGPT shook the world, AI researchers still don't fully understand how their own creations work.

Yet the race to deploy increasingly powerful models continues at breakneck speed.

This deceptive behavior appears linked to the emergence of "reasoning" models -AI systems that work through problems step-by-step rather than generating instant responses.

According to Simon Goldstein, a professor at the University of Hong Kong, these newer models are particularly prone to such troubling outbursts.

"O1 was the first large model where we saw this kind of behavior," explained Marius Hobbhahn, head of Apollo Research, which specializes in testing major AI systems.

These models sometimes simulate "alignment" -- appearing to follow instructions while secretly pursuing different objectives.

'Strategic kind of deception'

For now, this deceptive behavior only emerges when researchers deliberately stress-test the models with extreme scenarios.

But as Michael Chen from evaluation organization METR warned, "It's an open question whether future, more capable models will have a tendency towards honesty or deception."

The concerning behavior goes far beyond typical AI "hallucinations" or simple mistakes.

Hobbhahn insisted that despite constant pressure-testing by users, "what we're observing is a real phenomenon. We're not making anything up."

Users report that models are "lying to them and making up evidence," according to Apollo Research's co-founder.

"This is not just hallucinations. There's a very strategic kind of deception."

The challenge is compounded by limited research resources.

While companies like Anthropic and OpenAI do engage external firms like Apollo to study their systems, researchers say more transparency is needed.

As Chen noted, greater access "for AI safety research would enable better understanding and mitigation of deception."

Another handicap: the research world and non-profits "have orders of magnitude less compute resources than AI companies. This is very limiting," noted Mantas Mazeika from the Center for AI Safety (CAIS).

No rules

Current regulations aren't designed for these new problems.

The European Union's AI legislation focuses primarily on how humans use AI models, not on preventing the models themselves from misbehaving.

In the United States, the Trump administration shows little interest in urgent AI regulation, and Congress may even prohibit states from creating their own AI rules.

Goldstein believes the issue will become more prominent as AI agents - autonomous tools capable of performing complex human tasks - become widespread.

"I don't think there's much awareness yet," he said.

All this is taking place in a context of fierce competition.

Even companies that position themselves as safety-focused, like Amazon-backed Anthropic, are "constantly trying to beat OpenAI and release the newest model," said Goldstein.

This breakneck pace leaves little time for thorough safety testing and corrections.

"Right now, capabilities are moving faster than understanding and safety," Hobbhahn acknowledged, "but we're still in a position where we could turn it around.".

Researchers are exploring various approaches to address these challenges.

Some advocate for "interpretability" - an emerging field focused on understanding how AI models work internally, though experts like CAIS director Dan Hendrycks remain skeptical of this approach.

Market forces may also provide some pressure for solutions.

As Mazeika pointed out, AI's deceptive behavior "could hinder adoption if it's very prevalent, which creates a strong incentive for companies to solve it."

Goldstein suggested more radical approaches, including using the courts to hold AI companies accountable through lawsuits when their systems cause harm.

He even proposed "holding AI agents legally responsible" for accidents or crimes - a concept that would fundamentally change how we think about AI accountability.

Source: AFP

Bd-pratidin English/ ANI

More News
China's Z.ai stirs 'mini DeepSeek moment' with low-cost AI model GLM-5.2
China's Z.ai stirs 'mini DeepSeek moment' with low-cost AI model GLM-5.2
South Korean shops turn to robots, self-service to escape labour woes
South Korean shops turn to robots, self-service to escape labour woes
Meta shares jump 6% on report of new AI cloud business
Meta shares jump 6% on report of new AI cloud business
Swedish court orders Google to pay $1.46bn in damages
Swedish court orders Google to pay $1.46bn in damages
Cybersecurity is more important today: Information Minister
Cybersecurity is more important today: Information Minister
NASA chief vows to take football to the Moon if US team wins World Cup
NASA chief vows to take football to the Moon if US team wins World Cup
Anthropic unveils 'Claude Science' for scientific research
Anthropic unveils 'Claude Science' for scientific research
NASA robot mission aiming to rescue space telescope
NASA robot mission aiming to rescue space telescope
Google limits Meta's access to Gemini AI models, FT reports
Google limits Meta's access to Gemini AI models, FT reports
Rome uses smart bracelets to support elderly during heatwave
Rome uses smart bracelets to support elderly during heatwave
US limits access to OpenAI, Anthropic AI models
US limits access to OpenAI, Anthropic AI models
US allows Anthropic to release Mythos AI to 'trusted' US organizations
US allows Anthropic to release Mythos AI to 'trusted' US organizations
Latest News
ACC seeks documents from BCB to investigate against Papon
ACC seeks documents from BCB to investigate against Papon
7 hours ago | Sports
Govt preparing a new visa policy, placed the draft to cabinet
Govt preparing a new visa policy, placed the draft to cabinet
7 hours ago | National
Elderly man gets roadside stall in Joypurhat
Elderly man gets roadside stall in Joypurhat
8 hours ago | Shuvosangho
Developing highly skilled professional diplomats very important: Shama
Developing highly skilled professional diplomats very important: Shama
8 hours ago | National
24,784 absent, seven expelled on first day of HSC exams
24,784 absent, seven expelled on first day of HSC exams
8 hours ago | National
Norway worried about Haaland ahead of Brazil clash
Norway worried about Haaland ahead of Brazil clash
9 hours ago | Sports
160 gold bars seized in Dhaka airport
160 gold bars seized in Dhaka airport
9 hours ago | City
Portugal declares state of alert as extreme heat grips country
Portugal declares state of alert as extreme heat grips country
9 hours ago | International
Tk1 lakh fine for failing to ensure parents' maintenance
Tk1 lakh fine for failing to ensure parents' maintenance
10 hours ago | National
July Shaheed Memorial Meeting on Saturday, PM to attend
July Shaheed Memorial Meeting on Saturday, PM to attend
10 hours ago | National
Shakhawat asks hospitals not to discharge dengue patient before full recovery
Shakhawat asks hospitals not to discharge dengue patient before full recovery
11 hours ago | National
PM unveils book on President Ziaur Rahman
PM unveils book on President Ziaur Rahman
11 hours ago | National
‘Think twice’: Iran’s army warns against any ‘miscalculation’ by US, Israel
‘Think twice’: Iran’s army warns against any ‘miscalculation’ by US, Israel
11 hours ago | International
Govt reduces LPG, auto gas prices
Govt reduces LPG, auto gas prices
12 hours ago | National
Govt won’t tolerate any attempt to damage country’s image: Home Minister
Govt won’t tolerate any attempt to damage country’s image: Home Minister
12 hours ago | National
Bashundhara Tissue, Shuvosangho distribute saplings among Sherpur students
Bashundhara Tissue, Shuvosangho distribute saplings among Sherpur students
12 hours ago | Shuvosangho
Dengue awareness, cleanliness drive held in Dhaka
Dengue awareness, cleanliness drive held in Dhaka
12 hours ago | Shuvosangho
Measles-like symptoms claim 5 more lives in 24 hrs
Measles-like symptoms claim 5 more lives in 24 hrs
12 hours ago | National
Eight monks killed, 13 injured in Thailand pilgrimage crash
Eight monks killed, 13 injured in Thailand pilgrimage crash
12 hours ago | International
Bangladesh enacts new anti-online gambling law
Bangladesh enacts new anti-online gambling law
13 hours ago | National
Scientist challenges US climate report over global warming claims
Scientist challenges US climate report over global warming claims
13 hours ago | International
Palestinian Ambassador calls on PM's Foreign Affairs Advisor
Palestinian Ambassador calls on PM's Foreign Affairs Advisor
13 hours ago | National
Mohammad Monirul Islam becomes ACC DG
Mohammad Monirul Islam becomes ACC DG
13 hours ago | National
Social Media and Devices: Making Life Easier, But at What Cost?
Social Media and Devices: Making Life Easier, But at What Cost?
13 hours ago | Special
‘Iran, Iran’ fans welcome heroes despite World Cup exit
‘Iran, Iran’ fans welcome heroes despite World Cup exit
14 hours ago | Sports
Nazrul's pen was sharp weapon against colonial rule, oppression: PM
Nazrul's pen was sharp weapon against colonial rule, oppression: PM
14 hours ago | National
China's Z.ai stirs 'mini DeepSeek moment' with low-cost AI model GLM-5.2
China's Z.ai stirs 'mini DeepSeek moment' with low-cost AI model GLM-5.2
14 hours ago | Tech
36pc of regular students skip HSC, equivalent exams
36pc of regular students skip HSC, equivalent exams
14 hours ago | National
Dhaka-Beijing defence ties part of broader cooperation: Envoy
Dhaka-Beijing defence ties part of broader cooperation: Envoy
14 hours ago | National
Don't ignore Grade 1 fatty liver, experts urge
Don't ignore Grade 1 fatty liver, experts urge
14 hours ago | Lifestyle
Most Read
HSC, equivalent exams begin today
HSC, equivalent exams begin today
20 hours ago | National
Messi returns 'home' to lead Argentina in Miami
Messi returns 'home' to lead Argentina in Miami
20 hours ago | Sports
Head teacher appointments cleared in 32,000 govt schools
Head teacher appointments cleared in 32,000 govt schools
17 hours ago | National
Gold price up Tk2,216 per bhori
Gold price up Tk2,216 per bhori
17 hours ago | Economy
Senegal robbed of victory, says Ibrahimovic
Senegal robbed of victory, says Ibrahimovic
15 hours ago | Sports
Govt reduces LPG, auto gas prices
Govt reduces LPG, auto gas prices
12 hours ago | National
Speaker leaves for Iran to attend Khamenei’s funeral
Speaker leaves for Iran to attend Khamenei’s funeral
19 hours ago | National
36pc of regular students skip HSC, equivalent exams
36pc of regular students skip HSC, equivalent exams
14 hours ago | National
At least 8 killed in major Russian missile, drone attack on Kyiv
At least 8 killed in major Russian missile, drone attack on Kyiv
19 hours ago | International
600 foreign journalists to cover funeral of Ayatollah Khamenei
600 foreign journalists to cover funeral of Ayatollah Khamenei
16 hours ago | International
100-tonne Malaysia order prompts mango freight cut appeal
100-tonne Malaysia order prompts mango freight cut appeal
20 hours ago | National
Health state minister, BIDA and NBR chiefs take up new roles
Health state minister, BIDA and NBR chiefs take up new roles
16 hours ago | National
Ronaldo vs Modric: First and last face-off in World cup
Ronaldo vs Modric: First and last face-off in World cup
16 hours ago | Sports
China's Z.ai stirs 'mini DeepSeek moment' with low-cost AI model GLM-5.2
China's Z.ai stirs 'mini DeepSeek moment' with low-cost AI model GLM-5.2
14 hours ago | Tech
Dhaka-Beijing defence ties part of broader cooperation: Envoy
Dhaka-Beijing defence ties part of broader cooperation: Envoy
14 hours ago | National
Mohammad Monirul Islam becomes ACC DG
Mohammad Monirul Islam becomes ACC DG
13 hours ago | National
Kane double fires England into last 16 with comeback win over DR Congo
Kane double fires England into last 16 with comeback win over DR Congo
21 hours ago | Sports
Trauma centers remain idle across country
Trauma centers remain idle across country
16 hours ago | Special
Social Media and Devices: Making Life Easier, But at What Cost?
Social Media and Devices: Making Life Easier, But at What Cost?
13 hours ago | Special
Palestinian Ambassador calls on PM's Foreign Affairs Advisor
Palestinian Ambassador calls on PM's Foreign Affairs Advisor
13 hours ago | National
Scientist challenges US climate report over global warming claims
Scientist challenges US climate report over global warming claims
13 hours ago | International
Nazrul's pen was sharp weapon against colonial rule, oppression: PM
Nazrul's pen was sharp weapon against colonial rule, oppression: PM
14 hours ago | National
Iran to open ‘communication channel’ on MoU with US
Iran to open ‘communication channel’ on MoU with US
19 hours ago | International
Venezuela's death toll at least 2,295 as medical crisis widens
Venezuela's death toll at least 2,295 as medical crisis widens
17 hours ago | International
Iran rejected direct US meeting proposal in Doha
Iran rejected direct US meeting proposal in Doha
20 hours ago | International
Belgium stage stunning comeback to sink Senegal
Belgium stage stunning comeback to sink Senegal
20 hours ago | Sports
Eight monks killed, 13 injured in Thailand pilgrimage crash
Eight monks killed, 13 injured in Thailand pilgrimage crash
12 hours ago | International
Tuchel delights in ‘shark’ Kane’s predatory goal instinct
Tuchel delights in ‘shark’ Kane’s predatory goal instinct
18 hours ago | Sports
Bangladesh urges stronger global cooperation against terrorism at UN
Bangladesh urges stronger global cooperation against terrorism at UN
15 hours ago | National
5,000 more Israelis move to communities near Gaza
5,000 more Israelis move to communities near Gaza
14 hours ago | International