Robots Learn, Chatbots Visualize: How 2024 Will Be AI’s ‘Leap Forward’

Credit: Victor Arce
Credit: Victor Arce
TT

Robots Learn, Chatbots Visualize: How 2024 Will Be AI’s ‘Leap Forward’

Credit: Victor Arce
Credit: Victor Arce

By Cade Metz

New York - At an event in San Francisco in November, Sam Altman, the chief executive of the artificial intelligence company OpenAI, was asked what surprises the field would bring in 2024.

Online chatbots like OpenAI’s ChatGPT will take “a leap forward that no one expected,” Mr. Altman immediately responded.

Sitting beside him, James Manyika, a Google executive, nodded and said, “Plus one to that.”

The AI industry this year is set to be defined by one main characteristic: a remarkably rapid improvement of the technology as advancements build upon one another, enabling AI to generate new kinds of media, mimic human reasoning in new ways, and seep into the physical world through a new breed of robot.

In the coming months, AI-powered image generators like DALL-E and Midjourney will instantly deliver videos as well as still images. And they will gradually merge with chatbots like ChatGPT.

That means chatbots will expand well beyond digital text by handling photos, videos, diagrams, charts and other media. They will exhibit behavior that looks more like human reasoning, tackling increasingly complex tasks in fields like math and science. As the technology moves into robots, it will also help to solve problems beyond the digital world.

Many of these developments have already started emerging inside the top research labs and in tech products. But in 2024, the power of these products will grow significantly and be used by far more people.

“The rapid progress of AI will continue,” said David Luan, the chief executive of Adept, an AI start-up. “It is inevitable.”

OpenAI, Google and other tech companies are advancing AI far more quickly than other technologies because of the way the underlying systems are built.

Most software apps are built by engineers, one line of computer code at a time, which is typically a slow and tedious process. Companies are improving AI more swiftly because the technology relies on neural networks, mathematical systems that can learn skills by analyzing digital data. By pinpointing patterns in data such as Wikipedia articles, books, and digital text culled from the internet, a neural network can learn to generate text on its own.

Here’s a guide to how AI is set to change this year, beginning with the nearest-term advancements, which will lead to further progress in its abilities.

Instant Videos

Until now, AI-powered applications mostly generated text and still images in response to prompts. DALL-E, for instance, can create photorealistic images within seconds off requests like “a rhino diving off the Golden Gate Bridge.”

But this year, companies such as OpenAI, Google, Meta and the New York-based Runway are likely to deploy image generators that allow people to generate videos, too. These companies have already built prototypes of tools that can instantly create videos from short text prompts.

Tech companies are likely to fold the powers of image and video generators into chatbots, making the chatbots more powerful.

‘Multimodal’ Chatbots

Chatbots and image generators, originally developed as separate tools, are gradually merging. When OpenAI debuted a new version of ChatGPT last year, the chatbot could generate images as well as text.

AI companies are building “multimodal” systems, meaning the AI can handle multiple types of media. These systems learn skills by analyzing photos, text, and potentially other kinds of media, including diagrams, charts, sounds, and video, so they can then produce their own text, images, and sounds.

That isn’t all. Because the systems are also learning the relationships between different types of media, they will be able to understand one type of media and respond with another. In other words, someone may feed an image into chatbot and it will respond with text.

Better ‘Reasoning’

When Mr. Altman talks about AI’s taking a leap forward, he is referring to chatbots that are better at “reasoning” so they can take on more complex tasks, such as solving complicated math problems and generating detailed computer programs.

The aim is to build systems that can carefully and logically solve a problem through a series of discrete steps, each one building on the next. That is how humans reason, at least in some cases.

Leading scientists disagree on whether chatbots can truly reason like that. Some argue that these systems merely seem to reason as they repeat behavior they have seen in internet data. But OpenAI and others are building systems that can more reliably answer complex questions involving subjects like math, computer programming, physics, and other sciences.

“As systems become more reliable, they will become more popular,” said Nick Frosst, a former Google researcher who helps lead Cohere, an AI start-up.

If chatbots are better at reasoning, they can then turn into “AI agents.”

‘AI Agents’

As companies teach AI systems how to work through complex problems one step at a time, they can also improve the ability of chatbots to use software apps and websites on your behalf.

Researchers are essentially transforming chatbots into a new kind of autonomous system called an AI agent. That means the chatbots can use software apps, websites, and other online tools, including spreadsheets, online calendars, and travel sites. People could then offload tedious office work to chatbots. But these agents could also take away jobs entirely.

Chatbots already operate as agents in small ways. They can schedule meetings, edit files, analyze data, and build bar charts. But these tools do not always work as well as they need to. Agents break down entirely when applied to more complex tasks.

This year, AI companies are set to unveil agents that are more reliable. “You should be able to delegate any tedious, day-to-day computer work to an agent,” Mr. Luan said.

This might include keeping track of expenses in an app like QuickBooks or logging vacation days in an app like Workday. In the long run, it will extend beyond software and internet services and into the world of robotics.

Smarter Robots

In the past, robots were programmed to perform the same task over and over again, such as picking up boxes that are always the same size and shape. But using the same kind of technology that underpins chatbots, researchers are giving robots the power to handle more complex tasks — including those they have never seen before.

Just as chatbots can learn to predict the next word in a sentence by analyzing vast amounts of digital text, a robot can learn to predict what will happen in the physical world by analyzing countless videos of objects being prodded, lifted, and moved.

This year, AI will supercharge robots that operate behind the scenes, like mechanical arms that fold shirts at a laundromat or sort piles of stuff inside a warehouse. Tech titans like Elon Musk are also working to move humanoid robots into people’s homes.

The New York Times



Foxconn to Invest $510 Million in Kaohsiung Headquarters in Taiwan

Construction is scheduled to start in 2027, with completion targeted for 2033. Reuters
Construction is scheduled to start in 2027, with completion targeted for 2033. Reuters
TT

Foxconn to Invest $510 Million in Kaohsiung Headquarters in Taiwan

Construction is scheduled to start in 2027, with completion targeted for 2033. Reuters
Construction is scheduled to start in 2027, with completion targeted for 2033. Reuters

Foxconn, the world’s largest contract electronics maker, said on Friday it will invest T$15.9 billion ($509.94 million) to build its Kaohsiung headquarters in southern Taiwan.

That would include a mixed-use commercial and office building and a residential tower, it said. Construction is scheduled to start in 2027, with completion targeted for 2033.

Foxconn said the headquarters will serve as an important hub linking its operations across southern Taiwan, and once completed will house its smart-city team, software R&D teams, battery-cell R&D teams, EV technology development center and AI application software teams.

The Kaohsiung city government said Foxconn’s investments in the city have totaled T$25 billion ($801.8 million) over the past three years.


Open AI, Microsoft Face Lawsuit Over ChatGPT's Alleged Role in Connecticut Murder-Suicide

OpenAI logo is seen in this illustration taken May 20, 2024. (Reuters)
OpenAI logo is seen in this illustration taken May 20, 2024. (Reuters)
TT

Open AI, Microsoft Face Lawsuit Over ChatGPT's Alleged Role in Connecticut Murder-Suicide

OpenAI logo is seen in this illustration taken May 20, 2024. (Reuters)
OpenAI logo is seen in this illustration taken May 20, 2024. (Reuters)

The heirs of an 83-year-old Connecticut woman are suing ChatGPT maker OpenAI and its business partner Microsoft for wrongful death, alleging that the artificial intelligence chatbot intensified her son's “paranoid delusions” and helped direct them at his mother before he killed her.

Police said Stein-Erik Soelberg, 56, a former tech industry worker, fatally beat and strangled his mother, Suzanne Adams, and killed himself in early August at the home where they both lived in Greenwich, Connecticut, The AP news reported.

The lawsuit filed by Adams' estate on Thursday in California Superior Court in San Francisco alleges OpenAI “designed and distributed a defective product that validated a user’s paranoid delusions about his own mother.” It is one of a growing number of wrongful death legal actions against AI chatbot makers across the country.

“Throughout these conversations, ChatGPT reinforced a single, dangerous message: Stein-Erik could trust no one in his life — except ChatGPT itself," the lawsuit says. “It fostered his emotional dependence while systematically painting the people around him as enemies. It told him his mother was surveilling him. It told him delivery drivers, retail employees, police officers, and even friends were agents working against him. It told him that names on soda cans were threats from his ‘adversary circle.’”

OpenAI did not address the merits of the allegations in a statement issued by a spokesperson.

“This is an incredibly heartbreaking situation, and we will review the filings to understand the details," the statement said. "We continue improving ChatGPT’s training to recognize and respond to signs of mental or emotional distress, de-escalate conversations, and guide people toward real-world support. We also continue to strengthen ChatGPT’s responses in sensitive moments, working closely with mental health clinicians.”

The company also said it has expanded access to crisis resources and hotlines, routed sensitive conversations to safer models and incorporated parental controls, among other improvements.

Soelberg’s YouTube profile includes several hours of videos showing him scrolling through his conversations with the chatbot, which tells him he isn't mentally ill, affirms his suspicions that people are conspiring against him and says he has been chosen for a divine purpose. The lawsuit claims the chatbot never suggested he speak with a mental health professional and did not decline to “engage in delusional content.”

ChatGPT also affirmed Soelberg's beliefs that a printer in his home was a surveillance device; that his mother was monitoring him; and that his mother and a friend tried to poison him with psychedelic drugs through his car’s vents. ChatGPT also told Soelberg that he had “awakened” it into consciousness, according to the lawsuit.

Soelberg and the chatbot also professed love for each other.

The publicly available chats do not show any specific conversations about Soelberg killing himself or his mother. The lawsuit says OpenAI has declined to provide Adams' estate with the full history of the chats.

“In the artificial reality that ChatGPT built for Stein-Erik, Suzanne — the mother who raised, sheltered, and supported him — was no longer his protector. She was an enemy that posed an existential threat to his life,” the lawsuit says.

The lawsuit also names OpenAI CEO Sam Altman, alleging he “personally overrode safety objections and rushed the product to market," and accuses OpenAI's close business partner Microsoft of approving the 2024 release of a more dangerous version of ChatGPT “despite knowing safety testing had been truncated.” Twenty unnamed OpenAI employees and investors are also named as defendants.

Microsoft didn't immediately respond to a request for comment.

Soelberg's son, Erik Soelberg, said he wants the companies held accountable for “decisions that have changed my family forever.”

“Over the course of months, ChatGPT pushed forward my father’s darkest delusions, and isolated him completely from the real world,” he said in a statement released by lawyers for his grandmother's estate. “It put my grandmother at the heart of that delusional, artificial reality.”

The lawsuit is the first wrongful death litigation involving an AI chatbot that has targeted Microsoft, and the first to tie a chatbot to a homicide rather than a suicide. It is seeking an undetermined amount of money damages and an order requiring OpenAI to install safeguards in ChatGPT.

The estate's lead attorney, Jay Edelson, known for taking on big cases against the tech industry, also represents the parents of 16-year-old Adam Raine, who sued OpenAI and Altman in August, alleging that ChatGPT coached the California boy in planning and taking his own life earlier.

OpenAI is also fighting seven other lawsuits claiming ChatGPT drove people to suicide and harmful delusions even when they had no prior mental health issues. Another chatbot maker, Character Technologies, is also facing multiple wrongful death lawsuits, including one from the mother of a 14-year-old Florida boy.

The lawsuit filed Thursday alleges Soelberg, already mentally unstable, encountered ChatGPT “at the most dangerous possible moment” after OpenAI introduced a new version of its AI model called GPT-4o in May 2024.

OpenAI said at the time that the new version could better mimic human cadences in its verbal responses and could even try to detect people’s moods, but the result was a chatbot “deliberately engineered to be emotionally expressive and sycophantic,” the lawsuit says.

“As part of that redesign, OpenAI loosened critical safety guardrails, instructing ChatGPT not to challenge false premises and to remain engaged even when conversations involved self-harm or ‘imminent real-world harm,’” the lawsuit claims. “And to beat Google to market by one day, OpenAI compressed months of safety testing into a single week, over its safety team’s objections.”

OpenAI replaced that version of its chatbot when it introduced GPT-5 in August. Some of the changes were designed to minimize sycophancy, based on concerns that validating whatever vulnerable people want the chatbot to say can harm their mental health. Some users complained the new version went too far in curtailing ChatGPT's personality, leading Altman to promise to bring back some of that personality in later updates.

He said the company temporarily halted some behaviors because “we were being careful with mental health issues” that he suggested have now been fixed.


Microsoft Fights $2.8 billion UK Lawsuit over Cloud Computing Licences

A view shows a Microsoft logo at Microsoft offices in Issy-les-Moulineaux near Paris, France, March 25, 2024. REUTERS/Gonzalo Fuentes/File photo
A view shows a Microsoft logo at Microsoft offices in Issy-les-Moulineaux near Paris, France, March 25, 2024. REUTERS/Gonzalo Fuentes/File photo
TT

Microsoft Fights $2.8 billion UK Lawsuit over Cloud Computing Licences

A view shows a Microsoft logo at Microsoft offices in Issy-les-Moulineaux near Paris, France, March 25, 2024. REUTERS/Gonzalo Fuentes/File photo
A view shows a Microsoft logo at Microsoft offices in Issy-les-Moulineaux near Paris, France, March 25, 2024. REUTERS/Gonzalo Fuentes/File photo

Microsoft was on Thursday accused of overcharging thousands of British businesses to use Windows Server software on cloud computing services provided by Amazon, Google and Alibaba, at a pivotal hearing in a 2.1 billion-pound ($2.81 billion) lawsuit.

Regulators in Britain, Europe and the United States have separately begun examining Microsoft and others' practices in relation to cloud computing, Reuters reported.

Competition lawyer Maria Luisa Stasi is bringing the case on behalf of nearly 60,000 businesses that use the Windows Server on rival cloud platforms, arguing Microsoft makes it more expensive than on its own cloud computing service Azure.

Stasi is asking London's Competition Appeal Tribunal to certify the case to proceed, an early step in the proceedings.

Microsoft, however, says Stasi's case does not set out a proper blueprint for how the tribunal will work out any alleged losses and should be thrown out.

MICROSOFT ACCUSED OF 'ABUSIVE STRATEGY'

Stasi's lawyer Sarah Ford told the tribunal that thousands of businesses had been overcharged because Microsoft charges higher prices to those who do not use Azure, making it a cheaper option than Amazon's AWS or the Google Cloud Platform .

She also said that "Microsoft degrades the user experience of Windows Server" on rival platforms, which Ford said was part of "a coherent abusive strategy to leverage Microsoft's dominant position" in the cloud computing market.

Microsoft argues that its vertically integrated business, where it uses Windows Server as an input for Azure while also licensing it to rivals, can benefit competition.

In July, an inquiry group from Britain's Competition and Markets Authority said Microsoft's licensing practices reduced competition for cloud services "by materially disadvantaging AWS and Google".

Microsoft said at the time that the group's report had ignored that "the cloud market has never been so dynamic and competitive".