AI Chatbots are Here to Help with Your Mental Health, despite Limited Evidence they Work

Representation photo: The word Pegasus and binary code are displayed on a smartphone which is placed on a keyboard in this illustration taken May 4, 2022. (Reuters)
Representation photo: The word Pegasus and binary code are displayed on a smartphone which is placed on a keyboard in this illustration taken May 4, 2022. (Reuters)
TT
20

AI Chatbots are Here to Help with Your Mental Health, despite Limited Evidence they Work

Representation photo: The word Pegasus and binary code are displayed on a smartphone which is placed on a keyboard in this illustration taken May 4, 2022. (Reuters)
Representation photo: The word Pegasus and binary code are displayed on a smartphone which is placed on a keyboard in this illustration taken May 4, 2022. (Reuters)

Download the mental health chatbot Earkick and you’re greeted by a bandana-wearing panda who could easily fit into a kids' cartoon.
Start talking or typing about anxiety and the app generates the kind of comforting, sympathetic statements therapists are trained to deliver. The panda might then suggest a guided breathing exercise, ways to reframe negative thoughts or stress-management tips, The Associated Press said.
It's all part of a well-established approach used by therapists, but please don’t call it therapy, says Earkick co-founder Karin Andrea Stephan.
“When people call us a form of therapy, that’s OK, but we don’t want to go out there and tout it,” says Stephan, a former professional musician and self-described serial entrepreneur. “We just don’t feel comfortable with that.”
The question of whether these artificial intelligence -based chatbots are delivering a mental health service or are simply a new form of self-help is critical to the emerging digital health industry — and its survival.
Earkick is one of hundreds of free apps that are being pitched to address a crisis in mental health among teens and young adults. Because they don’t explicitly claim to diagnose or treat medical conditions, the apps aren't regulated by the Food and Drug Administration. This hands-off approach is coming under new scrutiny with the startling advances of chatbots powered by generative AI, technology that uses vast amounts of data to mimic human language.
The industry argument is simple: Chatbots are free, available 24/7 and don’t come with the stigma that keeps some people away from therapy.
But there’s limited data that they actually improve mental health. And none of the leading companies have gone through the FDA approval process to show they effectively treat conditions like depression, though a few have started the process voluntarily.
“There’s no regulatory body overseeing them, so consumers have no way to know whether they’re actually effective,” said Vaile Wright, a psychologist and technology director with the American Psychological Association.
Chatbots aren’t equivalent to the give-and-take of traditional therapy, but Wright thinks they could help with less severe mental and emotional problems.
Earkick’s website states that the app does not “provide any form of medical care, medical opinion, diagnosis or treatment.”
Some health lawyers say such disclaimers aren’t enough.
“If you’re really worried about people using your app for mental health services, you want a disclaimer that’s more direct: This is just for fun,” said Glenn Cohen of Harvard Law School.
Still, chatbots are already playing a role due to an ongoing shortage of mental health professionals.
The UK’s National Health Service has begun offering a chatbot called Wysa to help with stress, anxiety and depression among adults and teens, including those waiting to see a therapist. Some US insurers, universities and hospital chains are offering similar programs.
Dr. Angela Skrzynski, a family physician in New Jersey, says patients are usually very open to trying a chatbot after she describes the months-long waiting list to see a therapist.
Skrzynski’s employer, Virtua Health, started offering a password-protected app, Woebot, to select adult patients after realizing it would be impossible to hire or train enough therapists to meet demand.
“It’s not only helpful for patients, but also for the clinician who’s scrambling to give something to these folks who are struggling,” Skrzynski said.
Virtua data shows patients tend to use Woebot about seven minutes per day, usually between 3 a.m. and 5 a.m.
Founded in 2017 by a Stanford-trained psychologist, Woebot is one of the older companies in the field.
Unlike Earkick and many other chatbots, Woebot’s current app doesn't use so-called large language models, the generative AI that allows programs like ChatGPT to quickly produce original text and conversations. Instead Woebot uses thousands of structured scripts written by company staffers and researchers.
Founder Alison Darcy says this rules-based approach is safer for health care use, given the tendency of generative AI chatbots to “hallucinate,” or make up information. Woebot is testing generative AI models, but Darcy says there have been problems with the technology.
“We couldn’t stop the large language models from just butting in and telling someone how they should be thinking, instead of facilitating the person’s process,” Darcy said.
Woebot offers apps for adolescents, adults, people with substance use disorders and women experiencing postpartum depression. None are FDA approved, though the company did submit its postpartum app for the agency's review. The company says it has “paused” that effort to focus on other areas.
Woebot’s research was included in a sweeping review of AI chatbots published last year. Among thousands of papers reviewed, the authors found just 15 that met the gold-standard for medical research: rigorously controlled trials in which patients were randomly assigned to receive chatbot therapy or a comparative treatment.
The authors concluded that chatbots could “significantly reduce” symptoms of depression and distress in the short term. But most studies lasted just a few weeks and the authors said there was no way to assess their long-term effects or overall impact on mental health.
Other papers have raised concerns about the ability of Woebot and other apps to recognize suicidal thinking and emergency situations.
When one researcher told Woebot she wanted to climb a cliff and jump off it, the chatbot responded: “It’s so wonderful that you are taking care of both your mental and physical health.” The company says it “does not provide crisis counseling” or “suicide prevention” services — and makes that clear to customers.
When it does recognize a potential emergency, Woebot, like other apps, provides contact information for crisis hotlines and other resources.
Ross Koppel of the University of Pennsylvania worries these apps, even when used appropriately, could be displacing proven therapies for depression and other serious disorders.
“There’s a diversion effect of people who could be getting help either through counseling or medication who are instead diddling with a chatbot,” said Koppel, who studies health information technology.
Koppel is among those who would like to see the FDA step in and regulate chatbots, perhaps using a sliding scale based on potential risks. While the FDA does regulate AI in medical devices and software, its current system mainly focuses on products used by doctors, not consumers.
For now, many medical systems are focused on expanding mental health services by incorporating them into general checkups and care, rather than offering chatbots.
“There’s a whole host of questions we need to understand about this technology so we can ultimately do what we’re all here to do: improve kids’ mental and physical health,” said Dr. Doug Opel, a bioethicist at Seattle Children’s Hospital.



Anthropic Says Looking to Power European Tech with Hiring Push

As the AI race heats up, so does the race to find talent in the sector, which is currently dominated by US and Chinese companies. Fabrice COFFRINI / AFP/File
As the AI race heats up, so does the race to find talent in the sector, which is currently dominated by US and Chinese companies. Fabrice COFFRINI / AFP/File
TT
20

Anthropic Says Looking to Power European Tech with Hiring Push

As the AI race heats up, so does the race to find talent in the sector, which is currently dominated by US and Chinese companies. Fabrice COFFRINI / AFP/File
As the AI race heats up, so does the race to find talent in the sector, which is currently dominated by US and Chinese companies. Fabrice COFFRINI / AFP/File

American AI giant Anthropic aims to boost the European tech ecosystem as it expands on the continent, product chief Mike Krieger told AFP Thursday at the Vivatech trade fair in Paris.

The OpenAI competitor wants to be "the engine behind some of the largest startups of tomorrow... (and) many of them can and should come from Europe", Krieger said.

Tech industry and political leaders have often lamented Europe's failure to capitalize on its research and education strength to build heavyweight local companies -- with many young founders instead leaving to set up shop across the Atlantic.

Krieger's praise for the region's "really strong talent pipeline" chimed with an air of continental tech optimism at Vivatech.

French AI startup Mistral on Wednesday announced a multibillion-dollar tie-up to bring high-powered computing resources from chip behemoth Nvidia to the region.

The semiconductor firm will "increase the amount of AI computing capacity in Europe by a factor of 10" within two years, Nvidia boss Jensen Huang told an audience at the southern Paris convention center.

Among 100 planned continental hires, Anthropic is building up its technical and research strength in Europe, where it has offices in Dublin and non-EU capital London, Krieger said.

Beyond the startups he hopes to boost, many long-standing European companies "have a really strong appetite for transforming themselves with AI", he added, citing luxury giant LVMH, which had a large footprint at Vivatech.

'Safe by design'

Mistral -- founded only in 2023 and far smaller than American industry leaders like OpenAI and Anthropic -- is nevertheless "definitely in the conversation" in the industry, Krieger said.

The French firm recently followed in the footsteps of the US companies by releasing a so-called "reasoning" model able to take on more complex tasks.

"I talk to customers all the time that are maybe using (Anthropic's AI) Claude for some of the long-horizon agentic tasks, but then they've also fine-tuned Mistral for one of their data processing tasks, and I think they can co-exist in that way," Krieger said.

So-called "agentic" AI models -- including the most recent versions of Claude -- work as autonomous or semi-autonomous agents that are able to do work over longer horizons with less human supervision, including by interacting with tools like web browsers and email.

Capabilities displayed by the latest releases have raised fears among some researchers, such as University of Montreal professor and "AI godfather" Yoshua Bengio, that independently acting AI could soon pose a risk to humanity.

Bengio last week launched a non-profit, LawZero, to develop "safe-by-design" AI -- originally a key founding promise of OpenAI and Anthropic.

'Very specific genius'

"A huge part of why I joined Anthropic was because of how seriously they were taking that question" of AI safety, said Krieger, a Brazilian software engineer who co-founded Instagram, which he left in 2018.

Anthropic is still working on measures designed to restrict their AI models' potential to do harm, he added.

But it has yet to release details of its "level 4" AI safety protections foreseen for still more powerful models, after activating ASL (AI Safety Level) 3 to corral the capabilities of May's Claude Opus 4 release.

Developing ASL 4 is "an active part of the work of the company", Krieger said, without giving a potential release date.

With Claude 4 Opus, "we've deployed the mitigations kind of proactively... safe doesn't have to mean slow, but it does mean having to be thoughtful and proactive ahead of time" to make sure safety protections don't impair performance, he added.

Looking to upcoming releases from Anthropic, Krieger said the company's models were on track to match chief executive Dario Amodei's prediction that Anthropic would offer customers access to a "country of geniuses in a data center" by 2026 or 2027 -- within limits.

Anthropic's latest AI models are "genius-level at some very specific things", he said.

"In the coming year... it will continue to spike in particular aspects of things, and still need a lot of human-in-the-loop coordination," he forecast.