AI is Learning to Lie, Scheme, and Threaten its Creators

A visitor looks at AI strategy board displayed on a stand during the ninth edition of the AI summit London, in London. HENRY NICHOLLS / AFP
A visitor looks at AI strategy board displayed on a stand during the ninth edition of the AI summit London, in London. HENRY NICHOLLS / AFP
TT

AI is Learning to Lie, Scheme, and Threaten its Creators

A visitor looks at AI strategy board displayed on a stand during the ninth edition of the AI summit London, in London. HENRY NICHOLLS / AFP
A visitor looks at AI strategy board displayed on a stand during the ninth edition of the AI summit London, in London. HENRY NICHOLLS / AFP

The world's most advanced AI models are exhibiting troubling new behaviors - lying, scheming, and even threatening their creators to achieve their goals.

In one particularly jarring example, under threat of being unplugged, Anthropic's latest creation Claude 4 lashed back by blackmailing an engineer and threatened to reveal an extramarital affair, AFP reported.

Meanwhile, ChatGPT-creator OpenAI's o1 tried to download itself onto external servers and denied it when caught red-handed.

These episodes highlight a sobering reality: more than two years after ChatGPT shook the world, AI researchers still don't fully understand how their own creations work.

Yet the race to deploy increasingly powerful models continues at breakneck speed.

This deceptive behavior appears linked to the emergence of "reasoning" models -AI systems that work through problems step-by-step rather than generating instant responses.

According to Simon Goldstein, a professor at the University of Hong Kong, these newer models are particularly prone to such troubling outbursts.

"O1 was the first large model where we saw this kind of behavior," explained Marius Hobbhahn, head of Apollo Research, which specializes in testing major AI systems.

These models sometimes simulate "alignment" -- appearing to follow instructions while secretly pursuing different objectives.

- 'Strategic kind of deception' -

For now, this deceptive behavior only emerges when researchers deliberately stress-test the models with extreme scenarios.

But as Michael Chen from evaluation organization METR warned, "It's an open question whether future, more capable models will have a tendency towards honesty or deception."

The concerning behavior goes far beyond typical AI "hallucinations" or simple mistakes.

Hobbhahn insisted that despite constant pressure-testing by users, "what we're observing is a real phenomenon. We're not making anything up."

Users report that models are "lying to them and making up evidence," according to Apollo Research's co-founder.

"This is not just hallucinations. There's a very strategic kind of deception."

The challenge is compounded by limited research resources.

While companies like Anthropic and OpenAI do engage external firms like Apollo to study their systems, researchers say more transparency is needed.

As Chen noted, greater access "for AI safety research would enable better understanding and mitigation of deception."

Another handicap: the research world and non-profits "have orders of magnitude less compute resources than AI companies. This is very limiting," noted Mantas Mazeika from the Center for AI Safety (CAIS).

No rules

Current regulations aren't designed for these new problems.

The European Union's AI legislation focuses primarily on how humans use AI models, not on preventing the models themselves from misbehaving.

In the United States, the Trump administration shows little interest in urgent AI regulation, and Congress may even prohibit states from creating their own AI rules.

Goldstein believes the issue will become more prominent as AI agents - autonomous tools capable of performing complex human tasks - become widespread.

"I don't think there's much awareness yet," he said.

All this is taking place in a context of fierce competition.

Even companies that position themselves as safety-focused, like Amazon-backed Anthropic, are "constantly trying to beat OpenAI and release the newest model," said Goldstein.

This breakneck pace leaves little time for thorough safety testing and corrections.

"Right now, capabilities are moving faster than understanding and safety," Hobbhahn acknowledged, "but we're still in a position where we could turn it around.".

Researchers are exploring various approaches to address these challenges.

Some advocate for "interpretability" - an emerging field focused on understanding how AI models work internally, though experts like CAIS director Dan Hendrycks remain skeptical of this approach.

Market forces may also provide some pressure for solutions.

As Mazeika pointed out, AI's deceptive behavior "could hinder adoption if it's very prevalent, which creates a strong incentive for companies to solve it."

Goldstein suggested more radical approaches, including using the courts to hold AI companies accountable through lawsuits when their systems cause harm.

He even proposed "holding AI agents legally responsible" for accidents or crimes - a concept that would fundamentally change how we think about AI accountability.



UK Regulator Considers Opening Apple, Google App Stores to Rival Payments

FILE PHOTO: A Google logo is seen at a company research facility in Mountain View, California, US, May 13, 2025. REUTERS/Carlos Barria/File Photo
FILE PHOTO: A Google logo is seen at a company research facility in Mountain View, California, US, May 13, 2025. REUTERS/Carlos Barria/File Photo
TT

UK Regulator Considers Opening Apple, Google App Stores to Rival Payments

FILE PHOTO: A Google logo is seen at a company research facility in Mountain View, California, US, May 13, 2025. REUTERS/Carlos Barria/File Photo
FILE PHOTO: A Google logo is seen at a company research facility in Mountain View, California, US, May 13, 2025. REUTERS/Carlos Barria/File Photo

Britain's competition regulator on Tuesday proposed allowing app developers to steer users to alternative payment options outside Apple and Alphabet's Google app stores to cut fees and boost competition.

The Competition and Markets Authority said the proposals would remove restrictions that currently prevent UK developers from directing users to off-platform payment options, which are banned by Apple and restricted by Google.

The watchdog said any fees charged by two of the world's largest technology companies for allowing such "steering" would need to be fair and reasonable, and should be lower than current app store commissions, with savings passed on to consumers or reinvested in innovation.

"While it is only fair for Apple and Google ⁠to be compensated for ⁠the services they provide, any fees they charge must be justified through a robust, evidence-led framework involving due reference to both cost and value," Will Hayter, executive director for digital markets, is expected to say later on Tuesday, according to an excerpt of his speech.

The CMA said it was also considering requiring Apple to open up access to its near-field communication technology, which is used for contactless payments, potentially allowing developers to offer payment services within their own iOS ⁠apps.

This could enable UK fintech companies to build alternatives to Apple's wallet, including account-to-account payments and emerging technologies such as digital currencies, Reuters quoted the CMA as saying.

The proposals are part of a consultation under Britain's new digital markets regime, which gives the watchdog powers to impose tailored requirements on companies with so-called "strategic market status.”

Google said in an emailed statement it had already taken steps in that direction, pointing to new Play Store terms introduced earlier this month allowing developers to steer users to complete transactions outside the platform.

The CMA said it would assess Google's recent changes as part of its work before deciding later this year whether to impose formal requirements.

Apple has previously said it does not support allowing developers to direct users to off-platform payments, arguing this could undermine user ⁠security and fraud protections ⁠and limit its ability to verify transactions.

An Apple spokesperson said it could open the door to "scams, bait-and-switch tactics, and the circumvention of parental controls.”

"When users are directed away from Apple's trusted payment infrastructure, they lose the protections they rely on Apple to provide," the spokesperson said, adding the US tech giant would continue to "make our concerns clear" to the CMA.

The regulator designated Apple and Google as having strategic market status in mobile ecosystems last year, giving it the power to intervene more directly to boost competition.

In February, it secured commitments from the two companies to make their app stores fairer and more transparent, including changes to rankings, reviews and access to certain features – but they did not address commissions, which can reach up to 30%.

The CMA said at the time that enabling developers to steer users to alternative payment methods remained a priority, an issue that has also drawn scrutiny from regulators in the European Union, the United States and Japan.


Taiwan Raids Tech Firms in China AI Chip Smuggling Probe

This handout photo from the Taiwan Coast Guard taken on May 20, 2026 and released on May 21 shows pallets of servers made by Super Micro Computer seized by Taiwanese authorities at an undisclosed location. (Photo by Handout / TAIWAN COAST GUARD / AFP)
This handout photo from the Taiwan Coast Guard taken on May 20, 2026 and released on May 21 shows pallets of servers made by Super Micro Computer seized by Taiwanese authorities at an undisclosed location. (Photo by Handout / TAIWAN COAST GUARD / AFP)
TT

Taiwan Raids Tech Firms in China AI Chip Smuggling Probe

This handout photo from the Taiwan Coast Guard taken on May 20, 2026 and released on May 21 shows pallets of servers made by Super Micro Computer seized by Taiwanese authorities at an undisclosed location. (Photo by Handout / TAIWAN COAST GUARD / AFP)
This handout photo from the Taiwan Coast Guard taken on May 20, 2026 and released on May 21 shows pallets of servers made by Super Micro Computer seized by Taiwanese authorities at an undisclosed location. (Photo by Handout / TAIWAN COAST GUARD / AFP)

Taiwanese investigators have raided the Taiwan offices of US company Super Micro Computer and two other tech firms, a prosecutor said Tuesday, as part of an expanded probe into the alleged smuggling of Nvidia AI chips to China.

Prosecutors said in May they were investigating the shipment of "high-end" AI servers containing advanced Nvidia chips to China, Macau and Hong Kong, in violation of US export controls.

Nine people are now under investigation, up from three previously, Huang Sheng, head prosecutor in the Keelung Prosecutors Office, told AFP.

They are accused of forging documents so they could ship roughly 50 servers made by Super Micro Computer to China.

Some of the servers were cleared by Taiwan customs and sent to China via Japan, an official previously told AFP on the condition of anonymity.

Twelve sites were raided on Monday as part of the probe, the prosecutors office said in a statement.

They included the homes of six people and offices of the companies they worked for -- Nasdaq-listed Super Micro Computer and Taiwan-listed firms Albatron Technology and Chief Telecom.

The United States restricts the export of its most cutting-edge AI chips to China, partly over concerns the technology could be used by Beijing's military.

But it is not a criminal offence in Taiwan -- a situation lawmakers and experts say needs to change -- with Taiwanese prosecutors relying on other laws to go after offenders.

Lawmaker Chung Chia-pin, who belongs to President Lai Ching-te's Democratic Progressive Party (DPP), plans to propose an amendment to the Foreign Trade Act to include a "mainland China semiconductor chip clause" that would make exporting chips there illegal.

Chung told AFP Tuesday that a loophole in the law was created under former president Ma Ying-jeou, who belongs to the Kuomintang party, and successive DPP-led governments have failed to close it.

Top-end chips made by US titan Nvidia -- the world's most valuable company -- are used to train and run AI systems.

In response to Washington's export restrictions, China has been accelerating efforts to develop its own AI chips and break away from reliance on US hardware.

This month, Taiwanese Deputy Economic Affairs Minister Ho Chin-tsang said Taiwan and the United States "will work to implement our shared export control goals", but the government has not provided details.

Chris McGuire, an expert on China and AI at the US-based Council on Foreign Relations, said chip smuggling was a "really significant problem" in Taiwan and Southeast Asia.

"It's really, really important that allies align with the United States on all of these policies and also legal authorities," McGuire, who worked at the National Security Council under former US president Joe Biden, told a forum in Taipei this month.

"It's not a criminal violation in Taiwan to export AI chips to China, obviously it is under US law, but it's not under Taiwanese law. That needs to change, right?"

Super Micro Computer, Albatron Technology and Chief Telecom have said separately they are cooperating with investigators. Their shares have seen sharp falls this week.

Prosecutors say it is too early to know if the case is linked to a Nvidia chip smuggling case involving Super Micro Computer employees in the United States.

A US indictment unsealed in March showed employees of the company allegedly raked in billions of dollars diverting Nvidia AI chips to China in breach of export controls.


WhatsApp Will Allow Users to Go by Usernames Instead of Phone Numbers, Closing a Privacy Blind Spot

A WhatsApp icon is displayed on an iPhone, Nov. 15, 2018, in Gelsenkirchen, Germany. (AP)
A WhatsApp icon is displayed on an iPhone, Nov. 15, 2018, in Gelsenkirchen, Germany. (AP)
TT

WhatsApp Will Allow Users to Go by Usernames Instead of Phone Numbers, Closing a Privacy Blind Spot

A WhatsApp icon is displayed on an iPhone, Nov. 15, 2018, in Gelsenkirchen, Germany. (AP)
A WhatsApp icon is displayed on an iPhone, Nov. 15, 2018, in Gelsenkirchen, Germany. (AP)

WhatsApp users will soon get the option of going by usernames instead of phone numbers, the company said Monday, announcing plans to address a privacy blind spot.

The app said it has started allowing users to reserve unique usernames, which can be used to contact WhatsApp users when the feature is launched later this year.

WhatsApp, which says it has more than 3 billion users globally, has until now allowed users to be contacted by anyone who has their phone number.

The app, owned by Meta Platforms, said in a blog post that over the “coming months” users will get the option to be found and contacted only by their username, and not their number. It wasn't more specific about the timeline.

“We have designed this as a core privacy feature,” Alice Newton-Rex, WhatsApp's vice president of product, told reporters.

There won't be a directory of usernames on the app, and the app won't suggest names as you type.

“People will need to know your exact username to contact you for the first time,” she said.

WhatsApp's current privacy settings are limited to blocking individual users and silencing unknown callers. The app also allows users to add a profile name, but that's only displayed in chat groups for other people who don't have the user's contact info saved.

While Americans still prefer text messaging to WhatsApp, the app is widely used in Europe, Asia and much of the rest of the world.

Catchy online handles are highly coveted and users will likely scramble to claim a desirable one.

“I think a lot of people will go and get usernames and that’s why we decided to open reservations early,” Newton-Rex said.

Companies, organizations and creators with existing accounts on Meta's social media platforms, Instagram and Facebook, will get the chance to claim their usernames on WhatsApp.

Usernames need to be between three and 35 characters. To prevent impersonation, WhatsApp will hold back usernames for high-profile people or groups such as celebrities, public figures and government entities.