Robots Learn, Chatbots Visualize: How 2024 Will Be AI’s ‘Leap Forward’

Credit: Victor Arce
Credit: Victor Arce
TT

Robots Learn, Chatbots Visualize: How 2024 Will Be AI’s ‘Leap Forward’

Credit: Victor Arce
Credit: Victor Arce

By Cade Metz

New York - At an event in San Francisco in November, Sam Altman, the chief executive of the artificial intelligence company OpenAI, was asked what surprises the field would bring in 2024.

Online chatbots like OpenAI’s ChatGPT will take “a leap forward that no one expected,” Mr. Altman immediately responded.

Sitting beside him, James Manyika, a Google executive, nodded and said, “Plus one to that.”

The AI industry this year is set to be defined by one main characteristic: a remarkably rapid improvement of the technology as advancements build upon one another, enabling AI to generate new kinds of media, mimic human reasoning in new ways, and seep into the physical world through a new breed of robot.

In the coming months, AI-powered image generators like DALL-E and Midjourney will instantly deliver videos as well as still images. And they will gradually merge with chatbots like ChatGPT.

That means chatbots will expand well beyond digital text by handling photos, videos, diagrams, charts and other media. They will exhibit behavior that looks more like human reasoning, tackling increasingly complex tasks in fields like math and science. As the technology moves into robots, it will also help to solve problems beyond the digital world.

Many of these developments have already started emerging inside the top research labs and in tech products. But in 2024, the power of these products will grow significantly and be used by far more people.

“The rapid progress of AI will continue,” said David Luan, the chief executive of Adept, an AI start-up. “It is inevitable.”

OpenAI, Google and other tech companies are advancing AI far more quickly than other technologies because of the way the underlying systems are built.

Most software apps are built by engineers, one line of computer code at a time, which is typically a slow and tedious process. Companies are improving AI more swiftly because the technology relies on neural networks, mathematical systems that can learn skills by analyzing digital data. By pinpointing patterns in data such as Wikipedia articles, books, and digital text culled from the internet, a neural network can learn to generate text on its own.

Here’s a guide to how AI is set to change this year, beginning with the nearest-term advancements, which will lead to further progress in its abilities.

Instant Videos

Until now, AI-powered applications mostly generated text and still images in response to prompts. DALL-E, for instance, can create photorealistic images within seconds off requests like “a rhino diving off the Golden Gate Bridge.”

But this year, companies such as OpenAI, Google, Meta and the New York-based Runway are likely to deploy image generators that allow people to generate videos, too. These companies have already built prototypes of tools that can instantly create videos from short text prompts.

Tech companies are likely to fold the powers of image and video generators into chatbots, making the chatbots more powerful.

‘Multimodal’ Chatbots

Chatbots and image generators, originally developed as separate tools, are gradually merging. When OpenAI debuted a new version of ChatGPT last year, the chatbot could generate images as well as text.

AI companies are building “multimodal” systems, meaning the AI can handle multiple types of media. These systems learn skills by analyzing photos, text, and potentially other kinds of media, including diagrams, charts, sounds, and video, so they can then produce their own text, images, and sounds.

That isn’t all. Because the systems are also learning the relationships between different types of media, they will be able to understand one type of media and respond with another. In other words, someone may feed an image into chatbot and it will respond with text.

Better ‘Reasoning’

When Mr. Altman talks about AI’s taking a leap forward, he is referring to chatbots that are better at “reasoning” so they can take on more complex tasks, such as solving complicated math problems and generating detailed computer programs.

The aim is to build systems that can carefully and logically solve a problem through a series of discrete steps, each one building on the next. That is how humans reason, at least in some cases.

Leading scientists disagree on whether chatbots can truly reason like that. Some argue that these systems merely seem to reason as they repeat behavior they have seen in internet data. But OpenAI and others are building systems that can more reliably answer complex questions involving subjects like math, computer programming, physics, and other sciences.

“As systems become more reliable, they will become more popular,” said Nick Frosst, a former Google researcher who helps lead Cohere, an AI start-up.

If chatbots are better at reasoning, they can then turn into “AI agents.”

‘AI Agents’

As companies teach AI systems how to work through complex problems one step at a time, they can also improve the ability of chatbots to use software apps and websites on your behalf.

Researchers are essentially transforming chatbots into a new kind of autonomous system called an AI agent. That means the chatbots can use software apps, websites, and other online tools, including spreadsheets, online calendars, and travel sites. People could then offload tedious office work to chatbots. But these agents could also take away jobs entirely.

Chatbots already operate as agents in small ways. They can schedule meetings, edit files, analyze data, and build bar charts. But these tools do not always work as well as they need to. Agents break down entirely when applied to more complex tasks.

This year, AI companies are set to unveil agents that are more reliable. “You should be able to delegate any tedious, day-to-day computer work to an agent,” Mr. Luan said.

This might include keeping track of expenses in an app like QuickBooks or logging vacation days in an app like Workday. In the long run, it will extend beyond software and internet services and into the world of robotics.

Smarter Robots

In the past, robots were programmed to perform the same task over and over again, such as picking up boxes that are always the same size and shape. But using the same kind of technology that underpins chatbots, researchers are giving robots the power to handle more complex tasks — including those they have never seen before.

Just as chatbots can learn to predict the next word in a sentence by analyzing vast amounts of digital text, a robot can learn to predict what will happen in the physical world by analyzing countless videos of objects being prodded, lifted, and moved.

This year, AI will supercharge robots that operate behind the scenes, like mechanical arms that fold shirts at a laundromat or sort piles of stuff inside a warehouse. Tech titans like Elon Musk are also working to move humanoid robots into people’s homes.

The New York Times



Nvidia, Joining Big Tech Deal Spree, to License Groq Technology, Hire Executives

The Nvidia logo is seen on a graphic card package in this illustration created on August 19, 2025. (Reuters)
The Nvidia logo is seen on a graphic card package in this illustration created on August 19, 2025. (Reuters)
TT

Nvidia, Joining Big Tech Deal Spree, to License Groq Technology, Hire Executives

The Nvidia logo is seen on a graphic card package in this illustration created on August 19, 2025. (Reuters)
The Nvidia logo is seen on a graphic card package in this illustration created on August 19, 2025. (Reuters)

Nvidia has agreed to license chip technology from startup Groq and hire away its CEO, a veteran of Alphabet's Google, Groq said in a blog post on Wednesday.

The deal follows a familiar pattern in recent years where the world's biggest technology firms pay large sums in deals with promising startups to take their technology and talent but stop short of formally acquiring the target.

Groq specializes in what is known as inference, where artificial intelligence models that have already been trained respond to requests from users. While Nvidia dominates the market for training AI models, it faces much more competition in inference, where traditional rivals such as Advanced Micro Devices have aimed ‌to challenge it ‌as well as startups such as Groq and Cerebras Systems.

Nvidia ‌has ⁠agreed to a "non-exclusive" ‌license to Groq's technology, Groq said. It said its founder Jonathan Ross, who helped Google start its AI chip program, as well as Groq President Sunny Madra and other members of its engineering team, will join Nvidia.

A person close to Nvidia confirmed the licensing agreement.

Groq did not disclose financial details of the deal. CNBC reported that Nvidia had agreed to acquire Groq for $20 billion in cash, but neither Nvidia nor Groq commented on the report. Groq said in its blog post that it will continue to ⁠operate as an independent company with Simon Edwards as CEO and that its cloud business will continue operating.

In similar recent deals, Microsoft's ‌top AI executive came through a $650 million deal with a startup ‍that was billed as a licensing fee, and ‍Meta spent $15 billion to hire Scale AI's CEO without acquiring the entire firm. Amazon hired ‍away founders from Adept AI, and Nvidia did a similar deal this year. The deals have faced scrutiny by regulators, though none has yet been unwound.

"Antitrust would seem to be the primary risk here, though structuring the deal as a non-exclusive license may keep the fiction of competition alive (even as Groq’s leadership and, we would presume, technical talent move over to Nvidia)," Bernstein analyst Stacy Rasgon wrote in a note to clients on Wednesday after Groq's announcement. And Nvidia CEO Jensen Huang's "relationship with ⁠the Trump administration appears among the strongest of the key US tech companies."

Groq more than doubled its valuation to $6.9 billion from $2.8 billion in August last year, following a $750 million funding round in September.

Groq is one of a number of upstarts that do not use external high-bandwidth memory chips, freeing them from the memory crunch affecting the global chip industry. The approach, which uses a form of on-chip memory called SRAM, helps speed up interactions with chatbots and other AI models but also limits the size of the model that can be served.

Groq's primary rival in the approach is Cerebras Systems, which Reuters this month reported plans to go public as soon as next year. Groq and Cerebras have signed large deals in the Middle East.

Nvidia's Huang spent much of his biggest keynote speech of 2025 arguing that ‌Nvidia would be able to maintain its lead as AI markets shift from training to inference.


Italy Watchdog Orders Meta to Halt WhatsApp Terms Barring Rival AI Chatbots

The logo of Meta is seen at Porte de Versailles exhibition center in Paris, France, June 11, 2025. (Reuters)
The logo of Meta is seen at Porte de Versailles exhibition center in Paris, France, June 11, 2025. (Reuters)
TT

Italy Watchdog Orders Meta to Halt WhatsApp Terms Barring Rival AI Chatbots

The logo of Meta is seen at Porte de Versailles exhibition center in Paris, France, June 11, 2025. (Reuters)
The logo of Meta is seen at Porte de Versailles exhibition center in Paris, France, June 11, 2025. (Reuters)

Italy's antitrust authority (AGCM) on Wednesday ordered Meta Platforms to suspend contractual terms ​that could shut rival AI chatbots out of WhatsApp, as it investigates the US tech group for suspected abuse of a dominant position.

A spokesperson for Meta called the decision "fundamentally flawed," and said the emergence of AI chatbots "put a strain on our systems that ‌they were ‌not designed to support".

"We ‌will ⁠appeal," ​the ‌spokesperson added.

The move is the latest in a string by European regulators against Big Tech firms, as the EU seeks to balance support for the sector with efforts to curb its expanding influence.

Meta's conduct appeared capable of restricting "output, market ⁠access or technical development in the AI chatbot services market", ‌potentially harming consumers, AGCM ‍said.

In July, the ‍Italian regulator opened the investigation into Meta over ‍the suspected abuse of a dominant position related to WhatsApp. It widened the probe in November to cover updated terms for the messaging app's business ​platform.

"These contractual conditions completely exclude Meta AI's competitors in the AI chatbot services ⁠market from the WhatsApp platform," the watchdog said.

EU antitrust regulators launched a parallel investigation into Meta last month over the same allegations.

Europe's tough stance - a marked contrast to more lenient US regulation - has sparked industry pushback, particularly by US tech titans, and led to criticism from the administration of US President Donald Trump.

The Italian watchdog said it was coordinating with the European ‌Commission to ensure Meta's conduct was addressed "in the most effective manner".


Amazon Says Blocked 1,800 North Koreans from Applying for Jobs

Amazon logo (Reuters)
Amazon logo (Reuters)
TT

Amazon Says Blocked 1,800 North Koreans from Applying for Jobs

Amazon logo (Reuters)
Amazon logo (Reuters)

US tech giant Amazon said it has blocked over 1,800 North Koreans from joining the company, as Pyongyang sends large numbers of IT workers overseas to earn and launder funds.

In a post on LinkedIn, Amazon's Chief Security Officer Stephen Schmidt said last week that North Korean workers had been "attempting to secure remote IT jobs with companies worldwide, particularly in the US".

He said the firm had seen nearly a one-third rise in applications by North Koreans in the past year, reported AFP.

The North Koreans typically use "laptop farms" -- a computer in the United States operated remotely from outside the country, he said.

He warned the problem wasn't specific to Amazon and "is likely happening at scale across the industry".

Tell-tale signs of North Korean workers, Schmidt said, included wrongly formatted phone numbers and dodgy academic credentials.

In July, a woman in Arizona was sentenced to more than eight years in prison for running a laptop farm helping North Korean IT workers secure remote jobs at more than 300 US companies.

The scheme generated more than $17 million in revenue for her and North Korea, officials said.

Last year, Seoul's intelligence agency warned that North Korean operatives had used LinkedIn to pose as recruiters and approach South Koreans working at defense firms to obtain information on their technologies.

"North Korea is actively training cyber personnel and infiltrating key locations worldwide," Hong Min, an analyst at the Korea Institute for National Unification, told AFP.

"Given Amazon's business nature, the motive seems largely economic, with a high likelihood that the operation was planned to steal financial assets," he added.

North Korea's cyber-warfare program dates back to at least the mid-1990s.

It has since grown into a 6,000-strong cyber unit known as Bureau 121, which operates from several countries, according to a 2020 US military report.

In November, Washington announced sanctions on eight individuals accused of being "state-sponsored hackers", whose illicit operations were conducted "to fund the regime's nuclear weapons program" by stealing and laundering money.

The US Department of the Treasury has accused North Korea-affiliated cybercriminals of stealing over $3 billion over the past three years, primarily in cryptocurrency.