Ai Challenger Cerebras revela serviço de nuvem 'pay-per-model' com Cirrascale, Jasper

SERVIDORES

Cerebras says a partnership with Cirrascale will give researchers access to clusters of AI machines at a price far below the cost of typical multi-year leases. Seen here, a bank of Cerebras racks for its CS2 dedicated AI computers.

Cerebras Systems

Artificial intelligence computer maker Cerebras Systems, which has built chips and computers, and now makes super-computers dedicated to speeding up deep learning, on Tuesday announced services to speed the use of very large language models that are becoming increasing popular for not only research but also commercial use.

Special Feature

The Tech Trends to Watch in 2023

Learn about the leading tech trends the world will lean into over the next 12 months and how they will affect your life and your job.

Read now

"We believe that large language models are under-hyped, not over-hyped," said Cerebras co-founder and CEO Andrew Feldman in a press briefing. "We are just beginning to see the impact of them; there will be winners and new emergents in each of three layers in the ecosystem, in the hardware layer, the infrastructure layer, and the application layer."

Feldman predicted, "Next year you will see a sweeping rise in the impact of large language models in various parts of the economy."

Partnering with cloud computing service provider Cirrascale, Cerebras is offering what it calls "pay-per-model" compute time, a flat rate to train to convergence a large language model such as OpenAI's GPT-3 on clusters of its CS2 computers designed for deep learning.

Also: Tech in 2023: Here's what's really going to matter

The service is branded as Cerebras AI Model Studio.

Prices, ranging from$2,500 dollars to train a 1.3-billion-parameter model of GPT-3 in 10 hours to$2.5 million to train the 70-billion-parameter version in 85 days, are on average half the cost that users would pay to rent cloud capacity or lease machines for years to do the equivalent work. And the CS2 clusters can be eight times as fast to train as clusters of Nvidia A100 machines in the cloud.

Cirrascale is using a mix of clusters of owned CS2s and machines that Cerebras owns, as well as the Andromeda supercomputer, which is located at the colocation facilities of Santa Clara, California-based Colovore, where Cirrascale also has equipment installed.

Cerebras' price schedule in collaboration with Cirrascale promises to be half the average cost of cloud services or specialized clusters to train large models.

Cerebras Systems/Cirrascale

The partnership for Studio follows a partnership between Cerebras and Cirrascale announced a year ago to offer CS2 machines in the cloud on a weekly basis.

The service will automatically scale the size of clusters depending on the scale of the language model, said Feldman. The company emphasizes that training performance improves in linear proportion to adding more machines.

Scaling to the largest clusters would rise in price to a premium, said Feldman. For example, Andromeda's 16-machine cluster is four times as large as a four-way CS2 cluster, but using it would cost a customer probably five times as much money because it's reaching a higher level of performance.

Also:AI challenger Cerebras assembles modular supercomputer 'Andromeda' to speed up large language models

The most important immediate benefit of cutting the cost of large-model training may be to give access to large model development to parties that couldn't afford the sorts of enormous lease costs typically required, said Feldman.

"We've seen again and again that knowing pricing in advance, and the time it will take, are real issues for a whole class of customers, and we hope to overcome those issues," he said.

The alternative, said Feldman, is for companies to spend extensively to lease hardware for years at a time.

Cerebras Systems/Cirrascale

"If you think of the way the biggest models are being trained today, and they are all on dedicated clusters that are on several-year leases," said Feldman. "There are companies right now who have raised huge money and have tremendous valuations who in their wildest dreams have never owned hardware."

Also:AI chip startup Cerebras nabs$250 million Series F round at over$4 billion valuation

Also Tuesday, Cerebras announced that its Andromeda supercomputer, which it unveiled earlier this month, a cluster of 16 CS2 machines, will be used by Jasper, a venture-backed startup that runs large language models as a service for business applications such as generating press releases and blog posts.

Jasper, which has nearly a hundred thousand paying customers for its generative text function, serves enterprises that need to train large language models with customer data, such as a particular knowledge base, product catalog, and corporate "voice."

Cerebras Systems

"They want personalized models, and they want them badly," said Dave Rogenmoser, Jasper's CEO, in the same press briefing. The idea, he said, is to get the marketing department "all talking with the same voice" and for new hires to "get up to speed all speaking with the same voice" as the rest of the company. That includes things like a model generating Facebook ads using the customary language of the client.

The ability to cut the cost of training and dramatically speed up training time of large language models "is a huge draw for us" to working with Cerebras, said Rogenmoser.

Jasper recently closed on a Series A round valuing the company at$1.5 billion, said Rogenmoser.

Cerebras Systems

Using the dedicated clusters can be not only faster and cheaper, but more nuanced, said Cerebras' head of product, Andy Hock, in the same press briefing.

"One of the things we observe more broadly in the market is that many companies would like to be able to quickly research and develop these large-scale models, but the infrastructure that exists in traditional cloud just doesn't make this kind of large-scale research and development easy," Hock said.

"Being able to ask questions like, should I train from scratch [a large language model], or should I fine-tune an open-source public check-point, what is the best answer, what is the most effective use of compute to lower the cost of goods to deliver the best service to my customers -- being able to ask those questions is costly and impractical in many cases of traditional infrastructure."

The Cerebras clusters enable Jasper and other to ask those questions, he said.

Both announcements were made on the occasion of the 36th annual Neural Information Systems Conference, or NeurIPS, the premiere conference of the AI field, taking place this week in New Orleans.

Innovation

I tried Apple Vision Pro and it's far ahead of where I expectedThis tiny satellite communicator is packed full of features and peace of mindHow to use ChatGPT: Everything you need to knowThese are my 5 favorite AI tools for work

I tried Apple Vision Pro and it's far ahead of where I expected
This tiny satellite communicator is packed full of features and peace of mind
How to use ChatGPT: Everything you need to know
These are my 5 favorite AI tools for work

Cisco Price, Dell Price, Huawei Price, ZTE HPE Fortinet Switch Router Server At Low Price

SERVIDORES

NOTÍCIAS QUENTES

Huawei S5731-H24P4XC Switch Review: Power-Packed Performance and Smart PoE

Huawei S5731-H Series Switches Redefine Campus Networking with Intelligent High-Performance Architecture

Top Features of the Huawei S5731-S24T4X: The Ultimate Gigabit Access Switch for Modern Networks

General Power Module Fault Location Procedure (CE8800 & 7800 & 6800 & 5800)

How Do I Split a Stack? How to clear the stacking configuration?

Huawei CloudEngine S5731 Datasheet

Huawei CloudEngine S5731-S24P4X: Powerful Enterprise-Grade Switch Explained

Huawei S5731-S48T4X Review: Powerful Enterprise Switch for High-Speed Networking

Why are network cables limited to 100 meters?

Huawei S5731-S32ST4X: Powerful, Enterprise-Ready Gigabit Switch with Advanced Capabilities

Huawei S5731-H48T4XC Review: High-Performance Switching for Modern IT Infrastructures

Huawei S5731-H48P4XC: Comprehensive Overview

Common display Commands for Huawei Devices

Stacking Card Stacking vs. Service Port Stacking: Application Scenarios for the Two Switch Stacking Methods

Huawei S5731-H24T4XC: High-Performance Intelligent Gigabit Switch

Huawei S5731-S48P4X: High-Performance PoE Switch with Flexible Power and Uplink Options

Huawei S5731 Series: Advanced Networking Solutions for Enterprises

Difference between campus switch and data center switch

Huawei S6730-H28Y4C Campus CloudEngine Switch Datasheet

S6730-H48Y6C: Unleashing Power and Flexibility for Modern Networking

CloudEngine S6730-H Series Switches Datasheet

Huawei CloudEngine Switch S6730-S24X6Q Datasheet

CloudEngine S6700 Series Switches Naming Conventions & Description

Huawei CloudEngine S6730-H24X6C Datasheet

Huawei S6730 Series Switches Datasheet

Huawei CloudEngine Switch S6730-H48X6C Datasheet

Introduction to the Huawei CloudEngine S6730-S Series Switches

Huawei S6730-H48X6CZ-V2: The Ultimate High-Speed Network Switch

Overview of the S6730-H28X6CZ-V2 Switch

Huawei CloudEngine S6730-H24X4Y4C: A High-Performance Enterprise Switch for Modern Networks

AI challenger Cerebras unveils 'pay-per-model' AI cloud service with Cirrascale, Jasper

Special Feature

The Tech Trends to Watch in 2023

Innovation

Tags quentes : Inteligência artificial Inovação

Ordering Guide

Recursos

Quem somos