Cerebras Launches AI Inference Tool to Challenge Nvidia

Cerebras Systems logo is seen in this illustration taken March 31, 2023. (Reuters)
Cerebras Systems logo is seen in this illustration taken March 31, 2023. (Reuters)
TT

Cerebras Launches AI Inference Tool to Challenge Nvidia

Cerebras Systems logo is seen in this illustration taken March 31, 2023. (Reuters)
Cerebras Systems logo is seen in this illustration taken March 31, 2023. (Reuters)

Cerebras Systems launched on Tuesday a tool for AI developers that allows them to access the startup's outsized chips to run applications, offering what it says is a much cheaper option than industry-standard Nvidia processors.

Access to Nvidia graphics processing units (GPUs) - often via a cloud computing provider - to train and deploy large artificial intelligence models used for applications such as OpenAI's ChatGPT can be difficult to obtain and expensive to run, a process developers refer to as inference.

"We're delivering performance that cannot be achieved by a GPU," Cerebras CEO Andrew Feldman told Reuters in an interview. "We're doing it at the highest accuracy, and we're offering it at the lowest price."

The inference portion of the AI market is expected to be fast-growing and attractive - ultimately worth tens of billions of dollars if consumers and businesses adopt AI tools.

The Sunnyvale, California-based company plans to offer several types of the inference product via a developer key and its cloud. The company will also sell its AI systems to customers who prefer to operate their own data centers.

Cerebras' chips - each the size of a dinner plate and called Wafer Scale Engines - avoid one of the issues with AI data crunching: the data crunched by large models that power AI applications typically won't fit on a single chip and can require hundreds or thousands of chips strung together.

That means Cerebras' chips can achieve speedier performances, Feldman said.

It plans to charge users as little as 10 cents per million tokens, which are one of the ways companies can measure the amount of output data from a large model.

Cerebras is aiming to go public and filed a confidential prospectus with the Securities and Exchange Commission this month, the company said.



Alphabet to Roll out Image Generation of People on Gemini after Pause

A large Google logo is seen at Google's Bay View campus in Mountain View, California on August 13, 2024. (AFP)
A large Google logo is seen at Google's Bay View campus in Mountain View, California on August 13, 2024. (AFP)
TT

Alphabet to Roll out Image Generation of People on Gemini after Pause

A large Google logo is seen at Google's Bay View campus in Mountain View, California on August 13, 2024. (AFP)
A large Google logo is seen at Google's Bay View campus in Mountain View, California on August 13, 2024. (AFP)

Alphabet's Google said on Wednesday it has updated Gemini's AI image-creation model and would roll out the generation of visuals of people in the coming days, after months-long pause of the capability.

In February, Google had paused its AI tool that creates images of people, following inaccuracies in some historical depictions generated by the model.

The issues, where the AI model returned historical images which were sometimes inaccurate, drew flak from users.

The company said it has worked to improve the product, adhere to "product principles" and simulated situations to find weaknesses.

The feature will be made available first to paid users of the Gemini AI chatbot, starting in English and later roll out the model to bring more users and languages.

Google said it has improved the Imagen 3 model to create better images of people, but it would not generate images of specific people, children or graphic content.

OpenAI's Dall-E, Microsoft's CoPilot and recently xAI's Grok are among other AI chatbots that can now generate images.

The search engine giant also said over the coming days, subscribers to Gemini Advanced, Business and Enterprise would have access to chatting with "Gems" or chatbots customized for specific purposes.

Users can write specific instructions for particular purposes and create a Gem, saving them time from rewriting prompts for repetitive use cases.